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Abstract 

We present a system for generating parsers based directly on the 
metaphor of parsing as deduction. Parsing algorithms can be represented 
directly as deduction systems, and a single deduction engine can inter- 
pret such deduction systems so as to implement the corresponding parser. 
The method generalizes easily to parsers for augmented phrase structure 
formalisms, such as definite-clause grammars and other logic grammar for- 
malisms, and has been used for rapid prototyping of parsing algorithms 
for a variety of formalisms including variants of tree-adjoining grammars, 
categorial grammars, and lexicalized context-free grammars. 



This paper is available from the Center for Research in Computing Technology, Di- 
vision of Applied Sciences, Harvard University as Technical Report TR-11-94, and 
through the Computation and Language e-print archive as cmp-lg/9404008. 



1 Introduction 



Parsing can be viewed as a deductive process that seeks to prove claims about 
the grammatical status of a string from assumptions describing the grammatical 
properties of the s tring's element s and the linear order between them. Lambck's 
syntactic calculi ( Lambek, 1958 ) comprise an early formalization of this idea, 
which more recentl y was explored in relation to grammar form a lisms based 
on definite cla uses ( Colmcrauer, 1978 ]; pereira and Warren, 1980|; Pereira and 



Warren, 1983) and on feature logics ( Shieber, 
Ramcr, 19871 ; jCarpenter, 1992|) . 



1992; Rounds and Manaster- 



The view of parsing as deduction adds two main new sources of insights and 
techniques to the study of grammar formalisms and parsing: 

1. Existing logics can be used as a basis for new grammar formalisms with 
desirable representational or computational properties. 

2. The modular separation of parsing into a logic of grammatically claims 
and a proof search procedure allows the investigation of a wide range of 
parsing algorithms for existing grammar formalisms by selecting specific 
classes of grammaticality claims and specific search procedures. 

While most of the work on deductive parsing has been concerned with (1), we 
will in this paper investigate (2), more specifically how to synthesize parsing 
algorithms by combining specific logics of grammaticality claims with a fixed 
search procedure. In this way, deduction can provide a metaphor for parsing 
that encompasses a wide range of parsing algorithms for an assortment of gram- 
matical formalisms. We flesh out this metaphor by presenting a series of parsing 
algorithms literally as inference rules, and by providing a uniform deduction en- 
gine, parameterized by such rules, that can be used to parse according to any 
of the associated algorithms. The inference rules for each logic will be repre- 
sented as unit clauses and the fixed deduction procedure, which we provide a 
Prolog implementation of, will be a version of the usual bottom-up consequence 
closure operator for definite clauses. As we will show, this method directly 
yields dynamic-programming versions of standard top-down, bottom-up, and 
mixed-direction (Earley) parsing procedures. In this, our method has similari- 
ties with the use of pure bottom-up deduction to encode dynamic-programming 



versions of definite-clause proof procedures in deductive databases (Bancilhon 
|and Ramakrishnan, 1988] ; Naughton and Ramakrishnan, 1991 



The program that we develop is especially useful for rapid prototyping of 
and experimentation with new parsing algorithms, and was in fact developed for 
that purpose. We have used it, for instance, in the development of algorithms 
for parsing with tree-adjoining grammars, categorial grammars, and lexicalized 
context-free grammars. 

Many of the ideas that we present are not new. Some have been presented 
before; others form part of the folk wisdom of the logic programming community. 



1 



However, the present work is to our knowledge the first to make the ideas 
available explicitly in a single notation and with a clean implementation. In 
addition, certain observations regarding efficient implementation may be novel 
to this work. 

The paper is organized as follows: After reviewing some basic logical and 
grammatical notions and applying them to a simple example (Section ^), wc 
describe how the structure of a variety of parsing algorithms for context-free 
grammars can be expressed as inference rules in specialized logics (Section ||). 
Then, we extend the method for stating and implementing parsing algorithms 
for formalisms other than context-free grammars (Section^). Finally, we dis- 
cuss how deduction should proceed for such logics, developing an agenda-based 
deduction procedure implemented in Prolog that manifests the presented ideas 
(Section ||). 



2 Basic Notions 

As introduced in Section we see parsing as a deductive process in which 
rules of inference are used to derive statements about the grammatical status 
of strings from other such statements. Statements are represented by formulas 
in a suitable formal language. The general form of a rule of inference is 

Ai ■ ■ ■ At. 

- (side conditions on Ai, . . . , Ak, B) 

B 

The antecedents A\, . . . , Ak and the consequent B of the inference rule are for- 
mula schemata, that is, they may contain syntactic metavariables to be instan- 
tiated by appropriate terms when the rule is used. A grammatical deduction 
system is defined by a set of rules of inference and a set of axioms given by 
appropriate formula schemata. 

Given a grammatical deduction system, a derivation of a formula B from 
assumptions Ax, ... , A m is, as usual, a sequence of formulas Si, . . . , S n such 
that B — S n , and each S 1 , is either an axiom (one of the Aj) or there is a rule 
of inference R and formulas S^ , . . . , Si k with i\, . . . , iy. < i such that for appro- 
priate substitutions of terms for the metavariables in R, S^ , . . . , Si k match the 
antecedents of the rule, Si matches the consequent, and the rule's side condi- 
tions are satisfied. We write A\, . . . , A m h B and say that B is a consequence 
of A i, ... , A m if such a derivation exists. If B is a consequence of the empty set 
of assumptions, it is said to be derivable, in symbols h B. 

In our applications of this model, rules and axiom schemata may refer in 
their side conditions to the rules of a particular grammar, and formulas may 
refer to string positions in the fixed string to be parsed w — w\ ■ ■ ■ w n . With 
respect to the given string, goal formulas state that the string is grammatical 
according to the given grammar. Then parsing the string corresponds to finding 
a derivation witnessing a goal formula. 
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We will use standard notation for metavariables ranging over the objects 
under discussion: n for the length of the object language string to be parsed; 
A, B, C . . . for arbitrary formulas or symbols such as grammar nonterminals; 
a, 6, c, . . . for arbitrary terminal symbols; i, j, k, . . . for indices into various strings, 
especially the string w; a, f3, 7, . . . for strings or terminal and nonterminal sym- 
bols. We will often use such notations leaving the type of the object implicit 
in the notation chosen for it. Substrings will be notated elliptically as, e.g., 
Wi ■ ■ ■ Wj for the i-th through j-th elements of w, inclusive. As is usual, we take 
Wi ■ ■ ■ Wj to be the empty string if i > j. 



2.1 A First Example: CYK Parsing 

As a simple example, the basic mechanism of the Cocke- Younger-Kasami (CYK) 



context-free parsing algorithm (Kasami, 1965; Younger, 1967) for a context-free 



grammar in Chomsky normal form can be easily represented as a grammatical 
deduction system. 

We assume that we are given a string w — W\ ■ ■ ■ w n to be parsed and a 
context-free grammar G = (N, E, P, S) , where N is the set of nonterminals 
including the start symbol S, E is the set of terminal symbols, (V = N U E is 
the vocabulary of the grammar,) and P is the set of productions, each of the 
form A — ► a for A S N and a 6 V*. We will use the symbol =£- for immediate 
derivation and =>• for its reflexive, transitive closure, the derivation relation. In 
the case of a Chomsky-normal-form grammar, all productions are of the form 
A -> B C or A -> a. 

The items of the logic (as we will call parsing logic formulas from now on) 
are of the form L4,i, j], and state that the nonterminal A derives the substring 
between indices i and j in the string, that is, A => Wi+i ■ ■ - Wj. Sound axioms, 
then, are grounded in the lexical items that occur in the string. For each 
word Wi+i in the string and each rule A — > lUj+i, it is clear that the item 
[A, i, i + 1] makes a true claim, so that such items can be taken as axiomatic. 
Then whenever we know that B ifj+i • • • Wj and C => Wj+i • • • Wk — as 
asserted by items of the form [B,i,j] and [C,j,k] — where A — > B C is a 
production in the grammar, it is sound to conclude that C Wi+x • • * Wk, and 
therefore, the item [C, i, k] should be inferable. This argument can be codified 
in a rule of inference: 

[B,i,j] [C,j,k] A ^ BC 
[A, i, k] 

Using this rule of inference with the axioms, we can conclude that the string is 
admitted by the grammar if an item of the form [S, 0, n] is deducible, since such 
an item asserts that S => w\ ■ ■ ■ w n — w. We think of this item as the goal item 
to be proved. 

In summary, the CYK deduction system (and all the deductive parsing sys- 
tems we will define) can be specified with four components: a class of items; a 
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Item form: 



[Ahj] 



Axioms: [A, i, i + 1] A — > w^+i 

Goals: [£?, 0, n] 

Inference rules: ^'"'^ . A^BC 

[A, i, k] 



Figure 1: The CYK deductive parsing system. 



set of axioms; a set of inference rules; and a subclass of items, the goal items. 
These are given in summary form in Figure |l|. 

This deduction system can be encoded straightforwardly by the following 
logic program: 

nt(A, II, I) :- 

word(I , W) , 

(A — > [W]), 

II is I - 1. 
nt(A, I, K) :- 

nt(B, I, J), 

nt(C, J, K) , 

(A — > [B, C]) . 

where A > [X\,. . . ,Y m ] is the encoding of a production A — > X\ ■ ■ ■ X n 

in the grammar and word(i ,Wi) holds for each input word Wi in the string to 
be parsed. A suitable bottom-up execution of this program, for example using 



the semi-naive bottom-up procedure (Naughton and Ramakrishnan, 1991) will 



behave similarly to the CYK algorithm on the given grammar. 



2.2 Proofs of Correctness 

Rather than implement each deductive system like the CYK one as a separate 
logic program, we will describe in Section |5| a meta- interpreter for logic programs 
obtained from grammatical deduction systems. The meta-interpreter is just 
a variant of the semi-naive procedure specialized to programs implementing 
grammatical deduction systems. We will show in Section ||| that our procedure 
generates only items derivable from the axioms (soundness) and will enumerate 
all the derivable items (completeness). Therefore, to show that a particular 
parsing algorithm is correctly simulated by our meta-interpreter, we basically 
need to show that the corresponding grammatical deduction system is also sound 
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and complete with respect to the intended interpretation of grammaticality 
items. By sound here we mean that every derivable item represents a true 
grammatical statement under the intended interpretation, and by complete we 
mean that the item encoding every true grammatical statement is derivable. 
(We also need to show that the grammatical deduction system is faithfully 
represented by the corresponding logic program, but in general this will be 
obvious by inspection.) 

3 Deductive Parsing of Context-Free Grammars 

We begin the presentation of parsing methods stated as deduction systems with 
several standard methods for parsing context-free grammars. In what follows, 
we assume that we are given a string w = Wi • ■ • w n to be parsed along with a 
context-free grammar G = (N, S, P, S). 

3.1 Pure Top-Down Parsing (Recursive Descent) 

The first full parsing algorithm for arbitrary context-free grammars that we 
present from this logical perspective is recursive-descent parsing. Given a context- 
free grammar G = (N, S, P, S), and a string w = Wi ■ ■ ■ w n to be parsed, we 
will consider a logic with items of the form [ • j3, j] where < j < n. Such 
an item asserts that the substring of the string w up to and including the j-th 
element, when followed by the string of symbols (3, forms a sentential form of 
the language, that is, that S w\ ■ ■ ■ Wj(3. Note that the dot in the item is 
positioned just at the break point in the sentential form between the portion 
that has been recognized (up through index j) and the part that has not (/3). 

Taking the set of such items to be the [propositional] formulas of the logic, 
and taking the informal statement concluding the previous paragraph to provide 
a denotation for the sentences]^ we can explore a proof theory for the logic. We 
start with an axiom 



which is sound because S S trivially. 

Note that two items of the form [ • wj+ifi, j) and [ • /3, j + 1] make the same 
claim, namely that S w\ ■ ■ ■ WjWj+ifl. Thus, it is clearly sound to conclude 
the latter from the former, yielding the inference rule: 



[•S,0] 



[•w j+1 f3,j] 
[•PJ + l] 



1 A more formal statement of the semantics could be given, e.g., as 




truth if S => u>i ■ ■ ■ Wjj3 
falsity otherwise 
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Item form: 



[•0,3] 



Axioms: [ • S, 0] 

Goals: f • , n] 



Inference rules: 
Scanning 



'•w j+1 p,j] 
[•0,3 + 1] 



Prediction B -» 7 

[•10,31 

Figure 2: The top-down recursive-descent deductive parsing system. 

which we will call the scanning rule. 

A similar argument shows the soundness of the prediction rule: 

I'*™ B^, . 



'10, j] 



Finally, the item [• , n] makes the claim that S => w\ ■ ■ ■ w n , that is, that the 
string w is admitted by the grammar. Thus, if this goal item can be proved from 
the axiom by the inference rules, then the string must be in the grammar. Such 
a proof process would constitute a sound recognition algorithm. As it turns out, 
the recognition algorithm that this logic of items specifies is a pure top-down 
left-to-right regime, a recursive-descent algorithm. The four components of the 
deduction system for top-down parsing — class of items, axioms, inference rules, 
and goal items — are summarized in Figure [|. 

To illustrate the operation of these inference rules for context-free parsing, 
we will use the toy grammar of Figure |^. Given that grammar and the string 

W1W2W3 — a program halts (1) 



G 



s - 


-> NP VP 


Bet - 


-» a 


NP - 


-> Bet N OptRel 


N - 


-> program 


NP - 


4 PN 


PN - 


-> Terry 


VP - 


4 TV NP 


PN - 


-> Shrdlu 


VP - 


+ IV 


IV - 


■+ halts 


OptRel - 


* RelPro VP 


TV - 


■> writes 


OptRel - 


■> e 


RelPro - 


-» that 



Figure 3: An example context-free grammar, 
we can construct the following derivation using the rules just given: 



1 


[•S,0] 


AXIOM 


2 


[ • NP VP, 0] 


PREDICT from 


3 


[•BetN OptRel VP, 0] 


PREDICT from 


4 


[ • a N OptRel VP, 0] 


PREDICT from 


5 


[ • N OptRel VP, 1] 


SCAN from 4 


6 


[ • program OptRel VP, 1] 


PREDICT from 


7 


[ • OptRel VP, 2] 


SCAN from 6 


8 


[• VP, 2] 


PREDICT from 


9 


[•IV,2] 


PREDICT from 


10 


[ • halts, 2] 


PREDICT from 


11 


[•,3] 


SCAN from 10 



The last item is a goal item, showing that the given sentence is accepted by the 
grammar of Figure ^. 

The above derivation, as all the others we will show, contains just those items 
that are strictly necessary to derive a goal item from the axiom. In general, a 
complete search procedure, such as the one we describe in Section B, generates 
items that are either dead-ends or redundant for a proof of grammaticality. Fur- 
thermore, with an ambiguous grammar there will be several essentially different 
proofs of grammaticality, each corresponding to a different analysis of the input 
string. 

3.1.1 Proof of Completeness 

We have shown informally above that the inference rules for top-down parsing 
are sound, but for any such system we also need the guarantee of completeness: 
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if a string is admitted by the grammar, then for that string there is a derivation 
of a goal item from the initial item. 

In order to prove completeness, we prove the following lemma: If S 
w\ ■ ■ -Wj-f is a leftmost derivation (where 7 E V*), then the item [ • 7, j] is 
generated. We must prove all possible instances of this lemma. Any specific 
instance can be characterized by specifying the string 7 and the integer j, since 
S and Wi • ■ ■ Wj are fixed. We shall denote such an instance by (7, j). The proof 
will turn on ranking the various instances and proving the result by induction 
on the rank. The rank of the instance (7, j) is computed as the sum of j and 
the length of a shortest leftmost derivation of S =>■ ■ ■ ■ Wj^. 

If the rank is zero, then j = and 7 = 6'. Then, we need to show that [• S, 0] 
is generated, which is the case since it is an axiom of the top-down deduction 
system. 

For the inductive step, let (7, j) be an instance of the lemma of some rank 
r > 0, and assume that the lemma is true for all instances of smaller rank. Two 
cases arise. 

Case 1: S => wi ■ ■ ■ Wj"f in one step. Therefore, S — > w\ ■ ■ ■ Wj"/ is a rule of the 
grammar. However, since [ • S, 0] is an axiom, by one application of the 
prediction rule (predicting the rule S — ► Wi ■ ■ ■ Wjj) and j applications of 
the scanning rule, the item [ • 7, j] will be generated. 

Case 2: S 4> w\---Wj^ in more than one step. Let us assume therefore 
that S 4> wi---Wj-kB"f' wi ■ ■ ■ Wj fl"f' where 7 = ^7' and B — ► 
Wj-k+i • ■ ■ Wjf3. The instance (S7', j — k) has a strictly smaller rank than 
(7, j). Therefore, by the induction hypothesis, the item [• Bj',j — k] will 
be generated. But then, by prediction, the item [ • Wj^k+i • ■ • Wj(3,j — k] 
will be generated and by k applications of the scanning rule, the item 
[•B,j] will be generated. 

This concludes the proof of the lemma. Completeness of the parser follows 
as a corollary of the lemma since if S => wi ■ • ■ w n , then by the lemma the item 
[•, n] will be generated. 

Completeness proofs for the remaining parsing logics discussed in this paper 
could be provided in a similar way by relating an appropriate notion of normal- 
form derivation for the grammar formalism under consideration to the item 
invariants. 

3.2 Pure Bottom-Up Parsing (Shift-Reduce) 

A pure bottom-up algorithm can be specified by such a deduction system as 
well. Here, the items will have the form [a • , j]. Such an item asserts the dual 
of the assertion made by the top-down items, that awj + \ ■ ■ ■ w n 4> Wi ■ ■ ■ w n (or, 
equivalently but less transparently dual, that a => wi ■ ■ ■ Wj). The algorithm is 
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Item form: 



[" • . 3] 



Axioms: [ • , 0] 

Goals: [S • ,n] 
Inference Rules: 

shift [a *> j] 



[awj+i • , j + 1] 



Reduce ^Z'^l B -> 7 



Figure 4: The bottom-up shift-reduce deductive parsing system. 



then characterized by the deduction system shown in Figure |4|. The algorithm 
mimics the operation of a nondeterministic shift-reduce parsing mechanism, 
where the string of symbols preceding the dot corresponds to the current parse 
stack, and the substring starting at the index j corresponds to the as yet unread 
input. 

The soundness of the inference rules in Figure [| is easy to see. The an- 
tecedent of the shift rule claims that awj+i ■ ■ ■ w n =>- w\ ■ ■ ■ w n , but that is also 
what the consequent claims. For the reduce rule, if cpyuij+i ■ ■ ■ w n =$> wi ■ ■ ■ w n 
and B — > 7, then by definition of => wc also have aBwj+i ■ ■ ■ w n => W\ • ■ ■ w n . 
As for completeness, it can be proved by induction on the steps of a reversed 
rightmost context-free derivation in a way very similar to the completeness proof 
of the last section. 

The following derivation shows the operation of the bottom-up rules on 
example sentence (|l|): 



1 


[•A 


AXIOM 


2 


[a.,1] 


shift from 1 


3 


[Det»,l] 


REDUCE from 2 


4 


[Det program • , 2] 


SHIFT from 3 


5 


[Det N • , 2] 


REDUCE from 4 


6 


[Det N OptRel • , 2] 


REDUCE from 5 


7 


[NP»,2] 


REDUCE from 6 


8 


[NP halts • , 3] 


shift from 7 


9 


[NPIV»,3] 


REDUCE from 8 


10 


[NP VP • , 3] 


REDUCE from 9 


11 


[5., 3] 


REDUCE from 10 



The last item is a goal item, which shows that the sentence is parsable according 
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to the grammar. 



3.3 Earley's Algorithm 

Stating the algorithms in this way points up the duality of recursive-descent 
and shift-reduce parsing in a way that traditional presentations do not. The 
summary presentation in Figure [5] may further illuminate the various interrela- 
tionships. As we will see, Earley's algorithm ( Ear ley, 197C| ) can then be seen as 



the natural combination of these two algorithms. 

In recursive-descent parsing, we keep a partial sentential form for the mate- 
rial yet to be parsed, using the dot at the beginning of the string of symbols to 
remind us that these symbols come after the point that we have reached in the 
recognition process. In shift-reduce parsing, we keep a partial sentential form for 
the material that has already been parsed, placing a dot at the end of the string 
to remind us that these symbols come before the point that we have reached 
in the recognition process. In Earley's algorithm we keep both of these partial 
sentential forms, with the dot marking the point somewhere in the middle where 
recognition has reached. The dot thus changes from a mnemonic to a necessary 
role. In addition, Earley's algorithm localizes the piece of sentential form that is 
being tracked to that introduced by a single production. (Because the first two 
parsers do not limit the information stored in an item to only local information, 
they are not practical algorithms as stated. Rather some scheme for sharing the 
information among items would be necessary to make them tractable.) 

The items of Earley's algorithm are thus of the form [i, A — > a • f3,j] where 
a and f3 are strings in V* and A — > a/3 is a production of the grammar. As was 
the case for the previous two algorithms, the j index provides the position in the 
string that recognition has reached, and the dot position marks that point in 
the partial sentential form. In these items, however, an extra index i marks the 
starting position of the partial sentential form, as we have localized attention 
to a single production. In summary, an item of the form [i,A — > a • /3,j] 
makes the top-down claim that S =£- Wi • ■ • WiA-y, and the bottom-up claim that 
awj+i ■ ■ -w n =4> ifj+i • ■ • w n . The two claims are connected by the fact that 
A — * a(3 is a production in the grammar. 

The algorithm itself is captured by the specification found in Figure [5]. Proofs 
of soundness and completeness are somewhat more complex than those for the 
pure top-down and bottom-up cases shown above, and are directly related to 



the corresponding proofs for Earley's original algorithm (Earley, 197C). 

The following derivation, again for sentence ([!]), illustrates the operation of 
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Algorithm 


Bottom- Up 


Top-Down 


Earley 's 


Item form 




[•0,3] 


[i,A^a»0,j] 


Invariant 


* 

awj+i • • • w n w\ • • • w n 


S =>■ wi • • ■ uij/3 


S 4> wi • • ■ -0^7 
awj+i • • • u>„ => u>i + i • • • w n 


Axioms 


[•,0] 


[.5,0] 


[0, S' -» .5, 0] 


Goals 


[£•,71] 


[»,n] 


[0, /S — ► i> • , n] 


Scanning 




[•w j+ i/3,j] 


[i, A — ► a • iOj+i/9, j] 


K'+i • ,j + 1] 


[•/3,J + 1] 


[i, A aioj+i •0,j+ 1] 


Prediction 




[•7/U 7 




Completion 


[«7*,j] p . .. 




[M-»a.B/?,fc] [fc.fl- 7 •,.?'] 


[«s-,j] 7 





the Ear ley inference rules: 



1 


[0 


S' - 


» »s,o] 


AXIOM 




2 


[0 


S -> 


• NP VP, 0] 


PREDICT from 1 




3 


[0 


NP- 


-> •DetN OptRel, 0] 


predict from 2 




4 


[0 


Det 


-»• «a,0] 


predict from 3 




5 


[o 


Det 


-> a« , 1] 


SCAN from 4 




6 


[o 


NP- 


-> Det»N OptRel, 1] 


COMPLETE from 3 


and 5 


7 


[1 


N - 


► • program, 1] 


predict from 6 




8 


[1 


N - 


> program • , 2] 


SCAN from 7 




9 


[o 


NP- 


-* DetN* OptRel, 2] 


COMPLETE from 6 


and 8 


10 


[2 


OptRel^ »,2] 


predict from 9 




11 


[o 


NP- 


-> L»e< N OptRel • , 2] 


COMPLETE from 9 


and 10 


12 


[o 




NP»VP,2] 


COMPLETE from 2 


and 11 


13 


[2 


VP - 


^ .JV.2] 


PREDICT from 12 




14 


[2 


IV- 


■> • halts, 2] 


predict from 13 




15 


[2 


IV- 


■+ halts • , 3] 


SCAN from 14 




1G 


[2 


VP 


-> JF«,3] 


COMPLETE from 13 and 15 


17 


[0 


S -> 


VP* ,3] 


COMPLETE from 12 and 16 


18 


[0 


S' - 


► S.,3] 


COMPLETE from 1 and 17 



The last item is again a goal item, so we have an Earley derivation of the 
grammaticality of the given sentence. 



4 Deductive Parsing for Other Formalisms 

The methods (and implementation) that we developed have also been used for 
rapid prototyping and experimentation with parsing algorithms for grammatical 
frameworks other than context-free grammars. They can be naturally extended 
to handle augmented phrase-structure formalisms such as logic grammar and 
constraint-based formalisms. They have been used in the development and 
testing of algorithms for parsing categorial grammars, tree-adjoining grammars, 
and lexicalized context-free grammars. In this section, we discuss these and 
other extensions. 



4.1 Augmented Phrase-Structure Formalisms 

It is straightforward to see that the three deduction systems just presented 
can be extended to constraint-based grammar formalisms with a context-free 
backbone. The basis for this extension goes back to metamorphosis grammars 



( Colmerauer, 1978 ) and definite-clause grammars (DCG) ( Pereira and Warren, 



1980). In those formalisms, grammar symbols are first-order terms, which can 
be understood as abbreviations for the sets of all their ground instances. Then 
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an inference rule can also be seen as an abbreviation for all of its ground in- 
stances, with the metagrammatical variables in the rule consistently instantiated 
to ground terms. Computationally, however, such instances are generated lazily 
by accumulating the consistency requirements for the instantiation of inference 
rules as a conjunction of equality constraints and maintaining that conjunction 
in normal form — sets of variable substitutions — by unification. (This is di- 
rectly related to the use of unification to avoid "guessing" instances in the rules 
of existential generalization and universal instantiation in a natural-deduction 
presentation of first-order logic). 

We can move beyond first-order terms to general constraint-based grammar 
formalisms ( Shiebcr, 1992 ; |Carpenter, 1992 ) by taking the above constraint 
interpretation of inference rules as basic. More explicitly, a rule such as Earlcy 
completion 

[i,A^a*B/3,k] [k,B^ 7 *,j] 



[i,A-KxB»0,j\ 
is interpreted as shorthand for the constrained rule: 



[i,A->q» Bf3, fc] [fc,g'-> 7 .,j] 
[i,A'->aB"»p,j] 



A = A' and B — B' and B = B" 



When such a rule is applied, the three constraints it depends on are conjoined 
with the constraints for the current derivation. In the particular case of first- 
order terms and antecedent-to-consequent rule application, completion can be 
given more explicitly as 



[i,A-*a.Bl3,k] [k,B'^j*,j] 
[i,a(A —> aB • 0),j] 



a = mgu( J B, B') 



where mgu(B, B') is the most general unifier of the terms B and B 1 . This is the 
interpretation implemented by the deduction procedure described in the next 
section. 

The move to constraint-based formalisms raises termination problems in 
proof construction that did not arise in the context-free case. In the general 
case, this is inevitable, because a formalism like DCG (Pereira and Warren, 



1980) or PATR-II (Bhieber, 1985a) has Turing-machine power. However, even if 



constraints are imposed on the context-free backbone of the grammar produc- 



tions to guarantee decidability, such as offline parsability ( Bresnan and Kaplan, 



1982; Pereira and Warren, 1983; shieber, 1992), the prediction rules for the 



top-down and Earley systems are problematic. The difficulty is that prediction 
can feed on its own results to build unboundedly large items. For example, 
consider the DCG 

s -> r(0, N) 

r(X,N) -> r(s(X),N) b 
r (N, N) -> a 
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It is clear that this grammar accepts strings of the form ab n with the variable 
./V being instantiated to the unary (successor) representation of n. It is also 
clear that the bottom-up inference rules will have no difficulty in deriving the 
analysis of any input string. However, Earley prediction from the item [0, s — > 
• r(0, N), 0] will generate an infinite succession of items: 

[0,*-> •r(fl,N),0] 
[0,r(0,JV) -» •r(s{0),N) 6,0] 
[0,r(s(0),iV) -> .r(s(s(0)),iV) b, 0] 
[0,r(s(s(0)),iV) -> .r(s( S (s(0))),AT) 6,0] 

This problem can be solved in the case of the Earley inference rules by ob- 
serving that prediction is just used to narrow the number of items to be con- 
sidered by scanning and completion, by maintaining the top-down invariant 
S =r- w\ • • ■ WiA'j. But this invariant is not required for soundness or complete- 
ness, since the bottom-up invariant is sufficient to guarantee that items represent 
well- formed substrings of the input. The only purpose of the top-down invariant 
is to minimize the number of completions that are actually attempted. Thus the 
only indispensable role of prediction is to make available appropriate instances 
of the grammar productions. Therefore, any relaxation of prediction that makes 
available items of which all the items predicted by the original prediction rule are 
instances will not affect soundness or completeness of the rules. More precisely, 
it must be the case that any item [i,B — ► • 7, i] that the original prediction 
rule would create is an instance of some item [i,B' — ■> • 7',?] created by the 
relaxed prediction rule. A relaxed prediction rule will create no more items than 
the original predictor, and in fact may create far fewer. In particular, repeated 
prediction may terminate in cases like the one described above. For example, if 
the prediction rule applied to [i, A — ► a • B'j3, j] yields [i,<j(B — > • 7), i] where 
a = mgu(£>, £>'), a relaxed prediction rule might yield [i, a' (B — > • 7), i], where 
a' is a less specific substitution than a chosen so that only a finite number of 
instances of [i,B — > »7,«] are ever gene rated. A similar notion for g eneral 
constraint grammars is called restriction ( ^hicber." 1985b| ; IShicbcr, 1992j) and 



a related techniqu e has been used in partial evaluation of logic programs (tSato| 



and Tamaki, 1984|) 



The problem with the DCG above can be seen as following from the compu- 
tation of derivation-specific information in the arguments to the nonterminals. 
However, applications frequently require construction of the derivation for a 
string (or similar information), perhaps for the purpose of further processing. 
It is simple enough to augment the inference rules to include with each item a 
derivation. For the Earley deduction system, the items would include a fourth 
component whose value is a sequence of derivation trees, nodes labeled by pro- 
ductions of the grammar, one derivation tree for each element of the right-hand 
side of the item before the dot. The inference rules would be modified as shown 
in Figure a The system makes use of a function tree that takes a node label I 
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[yA-> a»w j+1 f3,j, D] 
i, A — > auijn • /3,j + 1, D 



Item form: 

Axioms: 

Goals: 

Inference rules: 
Scanning 

Prediction 

Completion 



'i, Aa • fi,j, D] 
0,5'^ -5,0,0] 
0,5' ->S»,n,D[ 



~i,A^ a • B/3, j, D] 
t, A — > a • B/3, fe, Di] 



[fc,B-> 7 *,j,£> 2 ] 



[i, A — > aB • /3, j, L»i U tree(S -> 7, D 2 )] 



Figure 6: The Ear ley deductive parsing system modified to generate derivation 
trees. 



(a production in the grammar) and a sequence of derivation trees D and forms 
a tree whose root is labeled by I and whose children are the trees in D in order. 

Of course, use of such rules makes the caching of lemmas essentially useless, 
as lemmas derived in different ways are never identical. Appropriate methods 
of implementation that circumvent this problem are discussed in Section 



5.4. 



4.2 Combinatory Categorial Grammars 



A combinatory categorial grammar (Ades and Steedman, 1982) consists of two 



parts: (1) a lexicon that maps words to sets of categories; (2) rules for combining 
categories into other categories. 

Categories are built from atomic categories and two binary operators: for- 
ward slash (/) and backward slash (\). Informally speaking, words having cate- 
gories of the form X/Y, X\Y, (W/X)/Y etc. are to be thought of as functions 
over Y's. Thus the category S\NP of intransitive verbs should be interpreted as 
a function from noun phrases (NP) to sentences (S). In addition, the direction 
of the slash (forward as in X/Y or backward as in X\Y) specifies where the 
argument must be found, immediately to the right for / or immediately to the 
left for \. 

For example, a CCG lexicon may assign the category S\NP to an intransitive 
verb (as the word sleeps). S\NP identifies the word (sleeps) as combining with a 
(subject) noun phrase (NP) to yield a sentence (S). The back slash (\) indicates 
that the subject must be found immediately to the left of the verb. The forward 
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Word Category 

John NP 

bananas NP 

likes (S\NP)/NP 

really (S\NP) / (S\NP) 

Figure 7: An example CCG lexicon. 



slash / would have indicated that the argument must be found immediately to 
the right of the verb. 

More formally categories are defined inductively as follows :0 Given a set of 
nonterminals, 

• Nonterminal symbols are categories. 

• If ci and C2 are categories, then (c\/c-i) and (c\\c-i) are categories. 

The lexicon is defined as a mapping / from words to finite sets of categories. 
Figure |?] is an example of a CCG lexicon. In this lexicon, likes is encoded as a 
transitive verb (S\NP) / NP, yielding a sentence (S) when a noun phrase (NP) 
object is found to its right and when a noun phrase subject (NP) is then found 
to its left. 

Categories can be combined by a finite set of rules that fall in two classes: 
application and composition. 

Application allows the simple combination of a function with an argument 
to its right (forward application) or to its left (backward application). For 
example, the sequence (S\NP) /NP NP can be reduced to S\NP by applying 
the forward application rule. Similarly, the sequence NP S\NP can be reduced 
to S by applying the backward application rule. 

Composition allows to combine two categories in a similar fashion as func- 
tional composition. For example, forward composition combines two categories 
of the form X/Y Y / Z to another category X / Z. The rule gives the appearance 
of "canceling" Y, as if the two categories were numerical fractions undergoing 
multiplication. This rule corresponds to the fundamental operation of "compos- 
ing" the two functions, the function X/Y from Y to X and the function Y/Z 
from Z to Y. 

The rules of composition can be specified formally as productions, but unlike 
the productions of a CFG, these productions are universal over all CCGs. In 
order to reduce the number of cases, we will use a vertical bar | as an instance of 
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The notati on for backward slash used in this paper is consistent with one denned by Ades 



and Steedman (1982): X\Y is interpreted as a function from Ys to Xs. Although this notation 
has been adopt ed by the majority of combinatory categorial grammarians, other frameworks 
( Lambek, 1958^ have adopted the opposite interpretation for X\Y: a function from Xs to Ys. 
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a forward or backward slash, / or \. Instances of | in left- and right-hand sides 
of a single production should be interpreted as representing slashes of the same 
direction. The symbols X, Y and Z are to be read as variables which match 
any category. 

Forward application: 
Backward application: 
Forward composition: 
Backward composition: 

A string of words is accepted by a CCG, if a specified nonterminal symbol 
(usually S) derives a string of categories that is an image of the string of words 
under the mapping /. 

A bottom-up algorithm — essentially the CYK algorithm instantiated for 
these productions — can be easily specified for CCGs. Given a CCG, and a 
string w = w\ ■ ■ ■ w n to be parsed, we will consider a logic with items of the 
form [X, i,j] where A is a category and i and j are integers ranging from to 
n. Such an item, asserts that the substring of the string w from the i + 1-th 
element up to the j-th element can be reduced to the category X. The required 
proof rules for this logic are given in Figure |^. 

To illustrate the operations, we will use the lexicon in Figure ^ to combine 
the string 

John really likes bananas (2) 
Among other ways, the sentence can be proved as follows: 

1 [NP, 0, 1] AXIOM 

2 [(S\NP)/(S\NP), 1,2} axiom 

3 [(S\NP)/NP,2,3\ axiom 

4 [(S\NP)/NP, 1,3] forward COMPOSITION from 2 and 3 

5 [NP, 3, 4] axiom 

6 [(S\NP), 1,4] forward APPLICATION from 4 and 5 

7 [S, 0,4] BACKWARD APPLICATION from 1 and 6 

Other extensions of CCG (such as generalized composition and coordination) 
can be easily implemented using such deduction parsing methods. 



X 
X 

x\z 
x\z 



X/Y Y 
Y X\Y 
X/Y Y\Z 
Y\Z X\Y 



4.3 Tree-Adjoining Grammars and Related Formalisms 



The formalism of tree-adjoining grammars (TAG) ( Joshi, Levy, and Takahashi, 



1975 ; Joshi, 1985 ) is a tree-generating system in which trees are combined by 
an operation of adjunction rather than the substitution operation of context- 
free grammars.^ The increased expressive power of adjunction allows important 

3 Most practical variants of TAG include both adjunction and substitution, but for purposes 
of exposition we restrict our attention to adjunction alone, since substitution is formally 
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Item form: 

Axioms: 

Goals: 

Inference rules: 

Forward Application 

Backward Application 

Forward Composition 1 

Forward Composition 2 

Backward Composition 1 

Backward Composition 2 



[X,i,i + 1] where X 6 f(w i+ i) 
[S,0,n] 



[X/Y,i,j] [Y,j,k] 
[X,i,j] 

[Y,i,j] [X\Y,j,k] 
[X,i,k] 

[X/Y,i,j] \Y/Z,j,k] 
[X/Z,i,k] 

[X/Y,t,j] \Y\Z,j,k] 
[X\Z,i,k] 

\Y/Z,i,j] [X\Y,j,k] 
[X/Z,i,k] 

\Y\Z,i,j] [X\Y,j,k] 
[X\Z,i,k] 



Figure 8: The CCG deductive parsing system. 
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VP 



NP 



VP 



VP* Adv 



NP 



Trip V 



rumbas 



VP 



(a) 



nimbly Trip VP Adv 



V nimbly 



rumbas 
(b) (c) 



Figure 9: An example tree-adjoining grammar consisting of one initial tree (a), 
and one auxiliary tree (b). These trees can be used to form the derived tree 
(c) for the sentence "Trip rumbas nimbly." (In an actual English grammar, the 
tree depicted in (a) would not be an elementary tree, but itself derived from 
two trees, one for each lexical item, by a substitution operation.) 



natural-language phenomena such as long-distance dependencies to be expressed 
locally in the grammar, that is, within the relevant lexical ent ries, rather than 
by many specialized context-free rules ( Kroch and Joshi, 198E| ). 

A tree-adjoining grammar consists of a set of elementary trees of two types: 
initial trees and auxiliary trees. An initial tree is complete in the sense that its 
frontier includes only terminal symbols. An example is given in Figure [|(a). An 
auxiliary tree is incomplete; it has a single node on the frontier, the foot node, 
labeled by the same nonterminal as the root. Figure ||(b) provides an example. 
(By convention, foot nodes are redundantly marked by a diacritic asterisk (*) 
as in the figure.) 

Although auxiliary trees do not themselves constitute complete grammatical 
structures, they participate in the construction of complete trees through the 
adjunction operation. Adjunction of an auxiliary tree into an initial tree is 
depicted in Figure [l^. The operation inserts a copy of an auxiliary tree into 
another tree in place of an interior node that has the same label as the root and 
foot nodes of the auxiliary tree. The subtree that was previously connected to 
the interior node is reconnected to the foot node of the copy of the auxiliary 
tree. For example, the auxiliary tree in Figure ^(b) can be adjoined at the VP 

dispensible and its implementation in parsing systems such as we describe is very much like the 
context-free operation. Similarly, we do not address other issues such as adjoining- constraint? 



and extended deriv ations. Discussion of those can be found elsewhere (Schabes, 1994 
|and Shicbcr, 1992 ). 



Schabes 
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initial tree auxiliary tree derived tree 




Figure 10: The operation of adjunction. The auxiliary tree is spliced into the 
initial tree to yield the derived tree at right. 

node of the initial tree in Figure ^|(a) to form the derived tree in Figure ||(c) . 
Adjunction in effect supports a form of string wrapping and is therefore more 
powerful than the substitution operation of context-free grammars. 

A tree- adjoining grammar can be specified as a quintuple G = (N, E, /, A, S), 
where N is the set of nonterminals including the start symbol S, E is the disjoint 
set of terminal symbols, I is the set of initial trees, and A is the set of auxiliary 
trees. 

To describe adjunction and TAG derivations, we need notation to refer to 
tree nodes, their labels, and the subtrees they define. Every node in a tree a can 
be specified by its address, a sequence of positive integers defined inductively 
as follows: the empty sequence e is the address of the root node, and p ■ k is 
the address of the fc-th child of the node at address p. Foot(a) is defined as 
the address of the foot node of the tree a if there is one; otherwise Foot(a) is 
undefined. 

We denote by a@p the node of a at address p, and by a/p the subtree of a 
rooted at a@p. The grammar symbol that labels node v is denoted by Label(v). 
Given an elementary tree node v, Adj(y) is defined as the set of auxiliary trees 
that can be adjoined at node 

4 For TAGs with no constraints on adjunction (for instance, as defined here), Adj(u) is just 
the set of elementary auxiliary trees whose root node is labeled by Label(u). When other 
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Finally, we denote by a[(3i t— * pi , . . . , /?& i— > pfe] the result of adjoining the 
trees . . . ,/3k at distinct addresses pi, . . . ,pu in the tree a. 

The set of trees D(G) derived by a TAG G can be defined inductively. D(G) 
is the smallest set of trees such that 

1. /UAC D(G), that is, all elementary trees are derivable, and 

2. Define D(a, G) to be the set of all trees derivable as a\f}\ i— ► pi, . . . , j3% i— > 
Pfe] where Pi, ...,0k S D(G) and Pi,.. - ,Pft are distinct addresses in a. 
Then, for all elementary trees ae/Ui, D(a,G) C D(G). Obviously, if 
a is an initial tree, the tree thus derived will have no foot node, and if a 
is an auxiliary tree, the derived tree will have a foot node. 

The valid derivations in a TAG are the trees in D(as, G) where as is an initial 
tree whose root is labeled with the start symbol S. 

Parsers for TAG can be described just as those for CFG, as deduction sys- 
tems. The parser we present here is a variant of the CYK algorithm extended 
for TAGs, similar, though not identical, to that of Vijay-Shanker ( |1987| ). We 
chose it for expository reasons: it is by far the simplest TAG parsing algorithm, 
in part because it is restricted to TAGs in which elementary trees are at most 
binary branching, but primarily because it is purely a bottom-up system; no 
prediction is performed. Despite its simplicity, the algorithm must handle the 
increased generative capacity of TAGs over that of context-free grammars. Con- 
sequently, the worst case complexity for the parser we present is worse than for 
CFGs — 0(n 6 ) time for a sentence of length n. 

The present algorithm uses a dotted tree to track the progress of parsing. A 
dotted tree is an elementary tree of the grammar with a dot adjacent to one of 
the nodes in the tree. The dot itself may be in one of two positions relative to the 
specified node: above or below. A dotted tree is thus specified as an elementary 
tree a, an address p in that tree, and a marker to specify the position of the dot 
relative to the node. We will use the notation v* and v, for dotted trees with 
the dot above and below node v, respectively.^ 

In order to track the portion of the string covered by the production up to 
the dot position, the CYK algorithm makes use of two indices. In a dotted tree, 
however, there is a further complication in that the elementary tree may contain 
a foot node so that the string covered by the elementary tree proper has a gap 
where the foot node occurs. Thus, in general, four indices must be maintained: 

adjoining constraints are allowed, as is standard, they can be incorporated through a revised 
definition of Adj. 

5 Although both this algorithm and Ear ley's use a dot in items to distinguish the progress 
of a parse, they are used in quite distinct ways. The dot of Earley's algorithm tracks the 
left-to-right progress of the parse among siblings. The dot of the CYK TAG parser tracks 
the pre-/post-adj unction s tatus of a singl e node. For this reason, when generalizing Earley's 



algorithm to TAG parsing (Schabes, 1994), four dot positions are used to simultaneously track 
pre-/post-adjunction and before/after node left-to-right progress. 
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two (i and I in Figure |T^) to specify the left edge of the auxiliary tree and the 
right edge of the parsed portion (up to the dot position) of the auxiliary tree, 
and two more (J and k) to specify the substring dominated by the foot node. 

The parser therefore consists of inference rules over items of the following 
forms: [v',i,j,k,l] and [v m ,i,j,k,l], where 

• v is a node in an elementary tree, 

• i, j,k,l are indices of positions in the input string wi • ■ ■ w n ranging over 
{0, • ■ • , n} U {_}, where _ indicates that the corresponding index is not used 
in that particular item. 

An item of the form [a@p',i, _, _, I] specifies that there is a tree T G D(a/p, G), 
with no foot node, such that the fringe of T is the string Wi+i ■ ■ ■ tuj. An item of 
the form [a@p',i, j, k, I] specifies that there is a tree T G D(a/p, G), with a foot 
node, such that the fringe of T is the string Wi+i ■ ■ ■ Wj Label(Foot(T)) Wk+i ■ • - Wi- 
The invariants for [a@p,,i, _, _, I] and [a@p,, k, 1} are similar, except that the 
derivation of T must not involve adjunction at node a@p. 

The algorithm preserves this invariant while traversing the derived tree from 
bottom to top, starting with items corresponding to the string symbols them- 
selves, which follow from the axioms 

[v* , i, _, _, i + 1] Label{v) — w i+ \ 

combining completed subtrees into larger ones, and combining subtrees before 
adjunction (with dot below) and derived auxiliary trees to form subtrees after 
adjunction (with dot above). Figure [H] depicts the movement of the dot from 
bottom to top as parsing proceeds. In Figure |ll](a), the basic rules of dot 
movement not involving adjunction are shown, including the axiom for terminal 
symbols, the combination of two subchildren of a binary tree and one child of a 
unary subtree, and the movement corresponding to the absence of an adjunction 
at a node. These are exactly the rules that would be used in parsing within a 
single elementary tree. Figure pd|(b) displays the rules involved in parsing an 
adjunction of one tree into another. 

These dot movement rules are exactly the inference rules of the TAG CYK 
deductive parsing system, presented in full in Figure ^|. In order to reduce the 
number of cases, we define the notation i U j for two indices i and j as follows: 



Although this parser works in time 0(n 6 ) — the Adjoin rule with its six 
independent indices is the step that accounts for this complexity — and its 
average behavior may be better, it is in practice too inefficient for practical 




i i = j 

undefined otherwise 



J 
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D 




Item form: 

Axioms: 

Terminal Axiom 

Empty String Axiom 

Foot Axiom 

Goals: 

Inference Rules: 
Complete Unary 

Complete Binary 

No Adjoin 

Adjoin 



v',i,j,k,l] 
v.,i,j,k,l] 



+ Label{i>) = Wi + \ 

v*,i,-,-,i] Label(v) = e 
P@Foot{p).,p,p,q,q] 

a@e*, 0, _, _, n] a e I and Label(a@e) = S 



a@{p-l)',i,j,k,l] 



a@(p ■ 2) undefined 



[a@p.,i,j,k,l] 
q@(p-l)V,j,M] [a@(p-2);i,j',k',m] 



[a@p.,i,jl)j', k U k',m] 
v.,i,j,k,l] 



y*,i,j,k,l] 

[P@e*,i,p,q,l] [v.,p,j,k,q] 
[v*,i,j,k,l] 



/3 e Adj{u) 



Figure 12: The CYK deductive parsing system for tree-adjoining grammars. 
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use for two reasons. First, an attempt is made to parse all auxiliary trees 
starting bottom-up from the foot node, regardless of whether the substring 
between the foot indices actually can be parsed in an appropriate manner. This 
problem can be alleviated, as suggested by Vijay-Shanker and Weir ( 1993| ), by 



replacing the Foot Axiom with a Complete Foot rule that generates the item 
[/3@Foot(f3) m ,p,p,q,q] only if there is an item [v m) p, j, k, q] where j3 G Adj(y), 
i.e., 

Complete Foot r ^^f'/'*^ r P 6 Adj{v) 

[[3@Foot{f3).,p,p,q,q] ' 

This complicates the invariant considerably, but makes auxiliary tree parsing 
much more goal-directed. Second, because of the lack of top-down prediction, 
attempts are made to parse elementary trees that are not consistent with the 
left context. Predictive parsers for TAG can be, and have been, described as 



deductive systems. For instance, Schabes ( 1994 ) provides a detailed explanation 
for a predictive left-to-right parser for TAG inspired by the techniques of Earley's 
algorithm. Its worst-case complexity is 0(n e ) as well, but its average complexity 
on English grammar is well superior to its worst case, and also to the CYK TAG 
parser. A parsing system based on this algorithm is currently being used in the 
development of a large English tree-adjoining grammar at the University of 



Pennsylvania (Paroubek, Schabes, and Joshi, 1992) 



Many other formalisms related to tree-adjoining grammars have been pro- 
posed, and the deductive parsing approach is applicable to these as well. For 
instance, as part of an investigation of the precise definition of TAG derivation, 
Schabes and Shieber describe a compilation of tree-adjoining grammars to lin- 
ear indexed grammars, together with an efficient algorithm, stated as deduction 



system for recognition and parsing according to the compiled grammar ( Sch- 
abes and Shieber, 1992| ) . A prototype of this parser has been implemented using 
the deduction engine described here. (In fact, it was as an aid to testing this 
algorithm, with its eight inference rules each with as many as three antecedent 
items, that the deductive parsing meta-interpreter was first built.) 



Schabes and Waters ( 1993b| ) suggest the use of a restricted form of TAG in 



which the foot node of an auxiliary tree can occur only at the left or right edge 
of the tree. Since the portion of string dominated by an auxiliary tree is con- 
tiguous under this constraint, only two indices are required to track the parsing 
of an auxiliary tree adjunction. Consequently, the formalism can generate only 
context-free languages and can be parsed in cubic time. The resulting system, 
called lexicalized context-free grammar (LCFG), is a compromise between the 
parsing efficiency of context-free grammar and the elegance and lexical sensitiv- 
ity of tree-adjoining grammar. The deductive parsing meta-interpreter has also 



been used for rapid prototyping of an Earley-style parser for LCFG ( [Schabes 



and Waters, 1993a| ) 
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4.4 Inadequacy for Sequent Calculi 

All the parsing logics discussed here have been presented in a natural-deduction 
format that can be implemented directly by bottom-up execution. However, 



important parsing logics, in particular the Lambek calculus (Lambek, 1958 
Moortgat, 1988] ), are better presented in a sequent-calculus format. The main 



reason for this is that those systems use nonatomic formulas that represent con- 
current or hypothetical analyses. For instance, if for arbitrary u with category 
B we conclude that vu has category A, then in the Lambek calculus we can 
conclude that v has category A/B. 

The main difficulty with applying our techniques to sequent systems is that 
computationally they are designed to be used in a top-down direction. For 
instance, the rule used for the hypothetical analysis above has the form: 

r h a/b [6) 

It is reasonable to use this rule in a goal-directed fashion (consequent to an- 
tecedent) to show r h A/B, but using it in a forward direction is impractical, 
because B must be arbitrarily assumed before knowing whether the rule is ap- 
plicable. 

More generally, in sequent formulations of syntactic calculi the goal sequent 
for showing the grammaticality of a string Wi has the form 

Wi ■ ■ ■ W n h s 

where Wi gives the grammatical category of Wi and S is the category of a sen- 
tence. Proof search proceeds by matching current sequents to the consequents 
of rules and trying to prove the corresponding antecedents, or by recognizing 
a sequent as an axiom instance A h A. The corresponding natural deduc- 
tion proof would start from the assumptions W\, . . . , W n and try to prove S, 
which is just the proof format that we have used here. However, sequent rules 
like (|3|) above correspond to the introduction of an additional assumption (not 
one of the Wi) at some point in the proof and its later discharge, as in the 
natural-deduction detachment rule for propositional logic. But such undirected 
introduction of assumptions just in case they may yield consequences that will 
be needed later is computationally very costly.^] Systems that make full use of 
the sequent formulation therefore seem to require top-down proof search. It is 
of course possible to encode top-down search in a bottom-up system by using 
more complex encodings of search state, as is done in Earley's algorithm or in 
t he magic sets/magic templates com pilation method for deductive databases 



(Bancilhon and Ramakrishnan, 1988; 



Ramakrishnan, 1988| ). Pentus ( |1993[) , for 



6 There is more than a passing similarity between this problem and the problem of pure 
bottom-up parsing with grammars with gaps. In fact, a nat ural logical formulation o f gaps is 



is ass umptions discharged by the wh-phrase they stand for (Pareschi and Miller, 1990: Hodas 



1992). 



26 



instance, presents a compilation of Lambek calculus to a CFG, which can then 
be processed by any of the standard methods. However, it is not clear yet that 
such techniques can be applied effectively to grammatical sequent calculi so that 
they can be implemented by the method described here. 



5 Control and Implementation 

The specification of inference rules, as carried out in the previous two sections, 
only partially characterizes a parsing algorithm, in that it provides for what 
items are to be computed, but not in what order. This further control informa- 
tion is provided by choosing a deduction procedure to operate over the inference 
rules. If the deduction procedure is complete, it actually makes little difference 
in what order the items are enumerated, with one crucial exception: We do not 
want to enumerate an item more than once. To prevent this possibility, it is 
standard to maintain a cache of lemmas, adding to the cache only those items 
that have not been seen so far. The cache plays the same role as the chart in 



chart-parsing algorithms (Kay, 1986), the well-formed substring table in CYK 



parsing (Kasami, 1965; Younger, 1967), and the state sets in Earley's algorithm 
(Earley, 1970). In this section, we develop a forward-chaining deduction proce- 
dure that achieves this elimination of redundancy by keeping a chart. 

Items should be added to the chart as they are proved. However, each new 
item may itself generate new consequences. The issue as to when to compute 
the consequences of a new item is quite subtle. A standard solution is to keep 
a separate agenda of items that have been proved but whose consequences have 
not been computed. When an item is removed from the agenda and added to 
the chart, its consequences are computed and themselves added to the agenda 
for later consideration. 

Thus, the general form of an agenda-driven, chart-based deduction procedure 
is as follows: 

1. Initialize the chart to the empty set of items and the agenda to the axioms 
of the deduction system. 

2. Repeat the following steps until the agenda is exhausted: 

(a) Select an item from the agenda, called the trigger item, and remove 
it. 

(b) Add the trigger item to the chart, if necessary. 

(c) If the trigger item was added to the chart, generate all items that 
are new immediate consequences of the trigger item together with all 
items in the chart, and add these generated items to the agenda. 

3. If a goal item is in the chart, the goal is proved (and the string recognized); 
otherwise it is not. 
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There are several issues that must be determined in making this general 
procedure concrete, which we describe under the general topics of eliminating 
redundancy and providing efficient access. At this point, however, we will show 
that, under reasonable assumptions, the general procedure is sound and com- 
plete. 

In the arguments that follow, we will assume that items are always ground 
and thus derivations are as defined in Section |[ A proof for the more general 
case, in which items denote sets of possible grammaticality judgments, would 
require more intricate definitions for items and inference rules, without changing 
the essence of the argument. 



Soundness We need to show that if the above procedure places item I in the 
chart when the agenda has been initialized in step (1) with items A\, . . . , Ak, 
then Ai,...,Ak h /. Since any item in the chart must have been in the 
agenda, and been placed in the chart by step (2b), it is sufficient to show that 
At, . . . , Ak h I for any I in the agenda. We show this by induction on the stage 
jj(J) of /, the number of the iteration of step (2) at which / has been added to 
the agenda, or if I has been placed in the agenda at step (1). Note that since 
several items may be added to the agenda on any given iteration, many items 
may have the same stage number. 

If jj(J) = 0, / must be an axiom, and thus the trivial derivation consisting 
of I alone is a derivation of I from A\, . . . , 

Assume that A\, . . . , Ak \~ J for jj(J) < n and that = n. Then / must 
have been added to the agenda by step (2c), and thus there are items Ji, . . . , J m 
in the chart and a rule instance such that 

— j — (side conditions on J\, . . . , J m , I) 

where the side conditions are satisfied. Since Ji, . . . , J m are in the chart, they 
must have been added to the agenda at the latest at the beginning of iteration 
n of step (2), that is, tt(Ji) < n. By the induction hypothesis, each Jj must 
have a derivation from Ax, . . . , Ak- But then, by definition of derivation, the 
concatenation of the derivations Ai, . . . , A m followed by / is a derivation of / 
from Ax, . .., A k . 



Completeness We want to show that if A\,...,Ak h /, then / is in the 
chart at step (3). Actually, we can prove something stronger, namely that / 
is eventually added to the chart, if we assume some form of fairness for the 
agenda. Then we will have covered cases in which the full iteration of step (2) 
does not terminate but step (3) can be interleaved with step (2) to recognize 
the goal as soon as it is generated. The form of fairness we will assume is that 
if (j(7) < j}( J) then item I is removed from the agenda by step (2a) before item 
J. The agenda mechanism described in Section 5.3 below satisfies the fairness 
assumption. 
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We show completeness by induction on the length of any derivation Di , . . . , D r , 
of I from Ai, . . . ,Ak- (Thus we show implicitly that the procedure generates 
every derivation, although in general it may share steps among derivations.) 

For n = 1, I = Dx = Ai for some i. It will thus be placed in the agenda at 
step (1), that is = 0. Thus by the fairness assumption / will be removed 
from the agenda in at most k iterations of step (2). When it is, it is either 
added to the chart as required, or the chart already contains the same item. 



(See discussion of the "if necessary" proviso of step (2b) in Section below.) 

Assume now that the result holds for derivations of length less than n. Con- 
sider a derivation D\, . . . , D n = I. Either / is an axiom, in which case we have 
just shown it will have been placed in the chart by iteration k, or, by definition 
of derivation, there are ii, . . . ,i m < n such that there is a rule instance 

(side conditions on , . . . D lm , /) (4) 

with side conditions satisfied. By definition of derivation, each prefix D\, . . . , Dj, 
of D\ , . . . , D n is a derivation of Di j from A\ , . . . , Ak- Then each Di . is in the 
chart, by the induction hypothesis. Therefore, for each Di j there must have been 
an identical item Ij in the agenda that was added to the chart at step (2b). Let 
I p be the item in question that was the last to be added to the chart. Immedi- 
ately after that addition, all of the Ij (that is, all of the Di,) are in the chart, 
and I p = Di p is the trigger item for rule application (jij) . Thus / is placed in the 
agenda. Since step (2c) can only add a finite number of items to the agenda, by 
the fairness assumption item / will eventually be considered at steps (2a) and 
(2b), and added to the chart if not already there. 



5.1 Eliminating Redundancy 

Redundancy in the chart. The deduction procedure requires the ability to 
generate new consequences of the trigger item and the items in the chart. The 
key word in this requirement is "new" . Indeed, the entire point of a chart-based 
system is to allow caching of proved lemmas so that previously proved (old) 
lemmas are not further pursued. It is therefore crucial that no item be added to 
the chart that already exists in the chart, and it is for this reason that step (2b) 
above specifies addition to the chart only "if necessary" . 



Definition of "redundant item". The point of the chart is to serve as a 
cache of previously proved items, so that an item already proved is not pursued. 
What does it mean for an item to be redundant, that is, occurring already in 
the agenda or chart? In the case of ground items, the appropriate notion of 
occurrence in the chart is the existence of an identical chart item. If items can 
be non-ground (for instance, when parsing relative to definite-clause grammars 
rather than context-free grammars) a more subtle notion of occurrence in the 
chart is necessary. As mentioned above, a non-ground item stands for all of 
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its ground instances, so that a non-ground item occurs in the chart if all its 
ground instances are covered by chart items, that is, if it is a specialization of 
some chart item. (This test suffices because of the strong compactness of sets 
of terms defined by equations: if the instances of a term A are a subset of the 
union of the instances of B and C . then the instances of A must be a subset 



of the instances of cither B or C ( Lasscz, Maher, and Marriot, 1988 ).) Thus 



the appropriate test is whether an item in the chart subsumes the item to be 
added.Q 



Redundancy in the agenda. We pointed out that redundancy checking in 
the chart is necessary. The issue of redundancy in the agenda is, however, a 
distinct one. Should an item be added to the agenda that already exists there? 

Finding the rule that matches a trigger item, triggering the generation of new 
immediate consequences, and checking that consequences are new are expensive 
operations to perform. The existence of duplicate items in the agenda therefore 
generates a spurious overhead of computation especially in pathological cases 
where exponentially many duplicate items can be created in the agenda, each 
one creating an avalanche of spurious overhead. 

For these reasons, it is also important to check for redundancy in the agenda, 
that is, the notion of "new immediate consequences" in step (2c) should be 
interpreted as consequent items that do not already occur in the chart or agenda. 
If redundancy checking occurs at the point items are about to be added to the 
agenda, it is not required when they are about to be added to the chart; the "if 
necessary" condition in step (2b) will in this case be vacuous, since always true. 



Triggering the generation of new immediate consequences. With re- 
gard to step (2c), in which we generate "all items that are new immediate 
consequences of the trigger item together with all other items in the chart" , we 
would like, if at all possible, to refrain from generating redundant items, rather 
than generating, checking for, and disposing of the redundant ones. Clearly, 
any item that is an immediate consequence of the other chart items only (that 
is, without the trigger item) is not a new consequence of the full chart. (It 
would have been generated when the last of the antecedents was itself added to 
the chart.) Thus, the inference rules generating new consequences must have 
at least one of their antecedent items being the trigger item, and the search for 
new immediate consequences can be limited to just those in which at least one 
of the antecedents is the trigger item. The search can therefore be carried out 
by looking at all antecedent items of all inference rules that match the trigger 
item, and for each, checking that the other antecedent items are in the chart. 
If so, the consequent of that rule is generated as a potential new immediate 

7 This subsumption check can be implemented in several ways in Prolog. The code in the 
appendix presents two of the options. 
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consequence of the trigger items plus other chart items. (Of course, it must be 
checked for prior existence in the agenda and chart as outlined above.) 

5.2 Providing Efficient Access 

Items should be stored in the agenda and chart in such a way that they can 
be efficiently accessed. Stored items are accessed at two points: when checking 
a new item for redundancy and when checking a (non-trigger) antecedent item 
for existence in the chart. For efficient access, it is desirable to be able to 
directly index into the stored items appropriately, but appropriate indexing 
may be different for the two access paths. We discuss the two types of indexing 
separately, and then turn to the issue of variable renaming. 

Indexing for redundancy checking. Consider, for instance, the Earley de- 
duction system. All items that potentially subsume an item [i,A — ► a • /3,j] 
have a whole set of attributes in common with the item, for instance, the indices 
i and j, the production from which the item was constructed, and the position 
of the dot (i.e., the length of a). Any or all of these might be appropriate for 
indexing into the set of stored items. 

Indexing for antecedent lookup. The information available for indexing 
when looking items up as potential matches for antecedents can be quite differ- 
ent. In looking up items that match the second antecedent of the completion 
rule [k,B — > 7 • , j], as triggered by an item of the form [i,A — ► a • B/3,k], 
the index k will be known, but j will not be. Similarly, information about B 
will be available from the trigger item, but no information about 7. Thus, an 
appropriate index for the second antecedent of the completion rule might in- 
clude its first index k and the main functor of the left-hand-side B. For the 
first antecedent item, a similar argument calls for indexing by its second index 
k and the main functor of the nonterminal B following the dot. The two cases 
can be distinguished by the sequence after the dot: empty in the former case, 
non-empty in the latter. 

Variable renaming. A final consideration in access is the renaming of vari- 
ables. As non-ground items stored in the chart or agenda are matched against 
inference rules, they become further instantiated. This instantiation should not 
affect the items as they are stored and used in proving other consequences, so 
that care must be taken to ensure that variables in agenda and chart items are 
renamed consistently before they are used. Prolog provides various techniques 
for achieving this renaming implicitly. 
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5.3 Prolog Implementation of Deductive Parsing 



In light of the considerations presented above, we turn now to our method of 
implementing an agenda-based deduction engine in Prolog. We take advantage 
of certain features that have become standard in Prolog implementations, such 
as clause indexing. The code described below is consistent with Quintus Prolog. 



5.3.1 Implementation of Agenda and Chart 

Since redundancy checking is to be done in both agenda and chart, we need the 
entire set of items in both agenda and chart to be stored together. For efficient 
access, we store them in the Prolog database under the predicate stored/2. 
The agenda and chart are therefore comprised of a series of unit clauses, e.g., 

storedd, item( . . . ) ) . < — beginning of chart 
stored(2, item( . . . ) ) . 
stored(3, item( . . . ) ) . 

stored (i— 1, item (...)). < — end of chart 

storedG, item(...)). < — head of agenda 
stored(i+l, item(...)). 

stored(fc— 1, item(...)). 

stored(fc, item(...)). < — tail of agenda 

The first argument of stored/2 is a unique identifying index that corresponds 
to the position of the item in the storage sequence of chart and agenda items. 
(This information is redundantly provided by the clause ordering as well, for 
reasons that will become clear shortly.) The index therefore allows (through 
Quintus's indexing of the clauses for a predicate by their first head argument) 
direct access to any stored item. 

Since items are added to the sequence at the end, all items in the chart 
precede all items in the agenda. The agenda items can therefore be characterized 
by two indices, corresponding to the first (head) and last (tail) items in the 
agenda. A data structure packaging these two "pointers" therefore serves as the 
proxy for the agenda in the code. An item is moved from the agenda to the 
chart merely by incrementing the head pointer. Items arc added to the agenda 
by storing the corresponding item in the database and incrementing the tail 
pointer. 

To provide efficient access to the stored items, auxiliary indexing tables can 
be maintained. Each such indexing table, is implemented as a set of unit clauses 
that map access keys to the indexes of items that match them. In the present 
implementation, a single indexing table (under the predicate key_index/2) is 
maintained that is used for accessing items both for redundancy checking and for 
antecedent lookup. (This is possible because only the item attributes available 



32 



in both types of access are made use of in the keys, leading to less than optimal 
indexing for redundancy checking, but use of multiple indexing tables leads to 
much more database manipulation, which is quite costly.) 

In looking up items for redundancy checking, all stored items should be 
considered, but for antecedent lookup, only chart items are pertinent. The dis- 
tinction between agenda and chart items is, under this implementation, implicit. 
The chart items are those whose index is less than the head index of the agenda. 
This test must be made whenever chart items are looked up. However, since 
clauses are stored sequentially by index, as soon as an item is found that fails 
the test (that is, is in the agenda) the search for other chart items can be cut 
off. 

5.3.2 Implementation of the Deduction Engine 

Given the design decisions described above, the general agenda-driven, chart- 
based deduction procedure presented in Section || can be implemented in Prolog 
as follows :0 

parse (Value) :- 

% (1) Initialize the chart and agenda 
init_chart , 
init_agenda(Agenda) , 

% (2) Remove items from the agenda and process 
% until the agenda is empty 
exhaust (Agenda) , 

% (3) Try to find a goal item in the chart 
goal_item_in_chart (Goal) . 

To exhaust the agenda, trigger items are repeatedly processed until the agenda 
is empty: 

exhaust (Empty) : - 

% (2) If the agenda is empty, we're done 
is_empty_agenda (Empty) . 

exhaust (AgendaO) :- 

% (2a) Otherwise get the next item index from the agenda 

pop_agenda(AgendaO, Index, Agendal) , 

% (2b) Add it to the chart 

add_item_to_chart (Index) , 

% (2c) Add its consequences to the agenda 

add_consequences_to_agenda(Index, Agendal, Agenda), 

8 The code presented here diverges slightly from that presented in the appendix for reasons 
of exposition. 
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% (2) Continue processing the agenda until empty 
exhaust (Agenda) . 

For each item, all consequences are generated and added to the agenda: 

add_consequences_to_agenda(Index, AgendaO, Agenda) :- 
f indall (Consequence , 

consequence (Index, Consequence), 

Consequences) , 
add_items_to_agenda (Consequences, AgendaO, Agenda). 

The predicate add_items_to_agenda/3 adds the items under appropriate in- 
dices as stored items and updates the head and tail indices in AgendaO to form 
the new agenda Agenda. 

A trigger item has a consequence if it matches an antecedent of some rule, 
perhaps with some other antecedent items and side conditions, and the other 
antecedent items have been previously proved (thus in the chart) and the side 
conditions hold: 

consequence (Index, Consequent) :- 
index_to_item(Index, Trigger), 
mat ching_rule (Trigger , 

RuleName, Others, Consequent, SideConds) , 
items_in_chart (Others, Index), 
hold(SideConds) . 

Note that the indices of items, rather than the items themselves are stored in the 
agenda, so that the index of the trigger item must first be mapped to the actual 
item (with index_to_item/2) before matching it against a rule antecedent. The 
items_in_chart/2 predicate needs to know both the items to look for (Others) 
and the index of the current item (index) as the latter distinguishes the items 
in the chart (before this index) from those in the agenda (after this index). 

We assume that the inference rules are stored as unit clauses under the pred- 
icate inference (RuleName , Antecedents, Consequent, SideConds) where 
RuleName is some mnemonic name for the rule (such as predict or scan), 
Antecedents is a list of the antecedent items of the rule, Consequent is the 
single consequent item, and Sideconds is a list of encoded Prolog literals to 
execute as side conditions. To match a trigger item against an antecedent of 
an inference rule, then, we merely select a rule encoded in this manner, and 
split up the antecedents into one that matches the trigger and the remaining 
unmatched antecedents (to be checked for in the chart). 

matching_rule (Trigger, 

RuleName, Others, Consequent, SideConds) :- 
inference (RuleName, Antecedents, Consequent, SideConds), 
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split (Trigger , Antecedents, Others) 



5.3.3 Implementation of Other Aspects 



A full implementation of the deduction-parsing system — complete with en- 
codings of several deduction systems and sample grammars — is provided in 
the appendix. The code in the appendix covers the following aspects of the 
implementation that are not elsewhere described. 



1. Input and encoding of the string to be parsed (Section A.l). 

2. Implementation of the deduction engine driver including generation of 
consequences (Section A.2). 



Encoding of the storage of items (Section [A. 3] ) including the implementa- 
tion of the chart (Section |A.3.1 ) and agenda (Section A. 3. 2). 



Encoding of deduction systems (Section A.4). 
Implementation of subsumption checking (Section 



All Prolog code is given in the Edinburgh notation, and has been tested under 
the Quintus Prolog system. 



5.4 Alternative Implementations 

This implementation of agenda and chart provides a compromise in terms of 
efficiency, simplicity, and generality. Other possibilities will occur to the reader 
that may have advantages under certain conditions. Some of the alternatives 
are described in this section. 



Separate agenda and chart in database. Storage of the agenda and the 
chart under separate predicates in the Prolog database allows for marginally 
more efficient lookup of chart items; an extraneous arithmetic comparison of 
indices is eliminated. However, this method requires an extra retraction and 
assertion when moving an index from agenda to chart, and makes redundancy 
checking more complex in that two separate searches must be engaged in. 



Passing agenda as argument. Rather than storing the agenda in the database, 
the list of agenda items might be passed as an argument. (The implementation 
of queues in Prolog is straightforward, and would be the natural structure to 
use for the agenda argument.) This method again has the marginal advantage 
in antecedent lookup, but it becomes almost impossible to perform efficient 
redundancy checking relative to items in the agenda. 
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Efficient bottom-up interpretation. The algorithm just presented can be 
thought of as a pure bottom-up evaluator for inference rules given as definite 
clauses, where the head of the clause is the consequent of the rule and the body 
is the antecedent. However, given appropriate inference rules, the bottom-up 
procedure will simulate non-bottom-up parsing strategies, such as the top-down 
and Earley strategies described in Section ||. Researchers in deductive databases 
have extensively investigated variants of that idea: how to take advantage of 
the tabulation of results in the pure bottom-up procedure while keeping track 
of goal-directed constraints on possible answers. As part of these investigations, 
efficient bottom-up evaluators for logic programs have been designed, for in- 



stance CORAL ( Ramakrishnan, Srivastava, and Sudarshan, 1992). Clearly, one 



could use such a system directly as a deduction parser. 

Construction of derivations. The direct use of the inference rules for build- 



ing derivations, as presented in Section 4.1, is computationally inefficient, since 



it eliminates structure-sharing in the chart. All ways of deriving the same string 
will yield distinct items, so that sharing of computation of subderivations is no 
longer possible. 

A preferable method is to compute the derivations offline by traversing the 
chart after parsing is finished. The deduction engine can be easily modified 
to do so, using a technique reminiscent of that used in the Core Language 
Engine ( Alshawi, 1992| ). First, we make use of two versions of each inference 



rule, an online version such as the Earley system given in Figure ||, with no 
computation of derivations, and an offline version like the one in Figure ^| that 
does generate derivation information. We will presume that these two ver- 
sions are stored, respectively, under the predicates inference/4 (as before) and 
inf erence_of f line/4, with the names of the rules specifying the correspon- 
dence between related rules. Similarly, the online initial_item/l specification 
should have a corresponding initial_item_of f line/1 version. 

The deduction engine parses a string using the online version of the rules, 
but also stores, along with the chart, information about the ways in which each 
chart item was constructed, with unit clauses of the form 

stored_history (Consequent , Rule, Antecedents). , 

which specifies that the item whose index is given by Consequent was generated 
by the inference rule whose name is Rule from the antecedent items given in the 
sequence Antecedents. (If an item is generated as an initial item, its history 
would mark the fact by a unit clause using the constant initial for the Rule 
argument.) 

When parsing has completed, a separate process is applied to each goal item, 
which traverses these stored histories using the second (offline) version of the in- 
ference rules rather than the first, building derivation information in the process. 
The following Prolog code serves the purpose. It defines of f line_item(Index , 
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Item), a predicate that computes the offline item Item (presumably including 
derivation information) corresponding to the online item with index given by 
Index, using the second version of the inference rules, by following the deriva- 
tions stored in the chart history. 

of f line_item(Index, Item) :- 

stored_history(Index, initial, _NoAntecedents) , 
initial_item_off line (Item) . 

of f line_item(Index, Item) :- 

stored_history (Index, Rule, Antecedents), 

of f line_items (Antecedents , Antecedentltems) 

inf erence_of f line (Rule , Antecedentltems, Item, SideConds) , 

hold(SideConds) . 

of f line_items ( [] , [] ) . 

of fline_items ( [Index I Indexes] , [Item I Items] ) :- 
of f line_item(Index, Item) , 
of fline_items (Indexes , Items). 

The offline version of the inference rules need not merely compute a deriva- 
tion. It might perform some other computation dependent on derivation, such 
as semantic interpretation. Abstractly, this technique allows for staging the 
parsing into two phases, the second comprising a more fine-grained version of 
the first. Any staged processing of this sort can be implemented using this 
technique. 



Finer control of execution order For certain applications, it may be neces- 
sary to obtain even finer control over the order in which the antecedent items and 
side conditions are checked when an inference rule is triggered. Given that the 
predicates items_in_chart/2 and holds/ 1 perform a simple left-to-right check- 
ing of the items and side conditions, the implementation of matching_rule/5 
above leads to the remaining antecedent items and side conditions being checked 
in left-to-right order as they appear in the encoded inference rules, and the side 
conditions being checked after the antecedent items. However, it may be prefer- 
able to check antecedents and side conditions interleaved, and in different orders 
depending on which antecedent triggered the rule. 

For instance, the side condition A = A' in the second inference rule of 
Section 11 must be handled before checking for the nontrigger antecedent of that 
rule, in order to minimize nondeterminism. If the first antecedent is the trigger, 
we want to check the side conditions and then look for the second antecedent, 
and correspondingly for triggering the second antecedent. The implementation 
above disallows this possibility, as side conditions are always handled after the 
antecedent items. Merely swapping the order of handling side conditions and 
antecedent items, although perhaps sufficient for this example, does not provide 
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a general solution to this problem. 

Various alternatives are possible to implement a finer level of control. We 
present an especially brutish solution here, although more elegant solutions are 
possible. Rather than encoding an inference rule as a single unit clause, we 
encode it with one clause per trigger element under the predicate 

inference (RuleName , Antecedents, Consequent) 

where Rulename and Consequent are as before, but Antecedents is now a list 
of all the antecedent items and side conditions of the rule, with the trigger item 
first. (To distinguish antecedent items from side conditions, a disambiguating 
prefix operator can be used, e.g., @item( . . . ) versus ?side_condition( ...).) 
Matching an item against a rule then proceeds by looking for the item as the 
first element of this antecedent list. 

matching_rule (Trigger , RuleName, Others, Consequent) :- 
inference (RuleName, [Trigger I Others] , Consequent), 

The consequence/2 predicate is modified to use this new matching_rule/4 
predicate, and to check that all of the antecedent items and side conditions 
hold. 

consequence (Index, Consequent) :- 
index_to_item(Index, Trigger), 

mat ching_rule (Trigger, RuleName, Others, Consequent), 
hold(Others, Index). 

The antecedent items and side conditions are then checked in the order in 
which they occur in the encoding of the inference rule. 

hold([] , _Index) . 

hold ( [Antecedent I Antecedents] , Index) :- 
holds (Antecedent , Index), 
hold (Antecedents , Index). 

holds (Oltem, Index) :- item_in_chart (Item, Index), 

holds (?SideCond, _Index) :- call (SideCond) . 



6 Conclusion 

The view of parsing as deduction presented in this paper, which generalizes that 
of previous work in the area, makes possible a simple method of describing a 
variety of parsing algorithms — top-down, bottom-up, and mixed — in a way 
that highlights the relationships among them and abstracts away from incidental 
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differences of control. The method generalizes easily to parsers for augmented 
phrase structure formalisms, such as definite-clause grammars and other logic 
grammar formalisms. Although the deduction systems do not specify detailed 
control structure, the control information needed to turn them into full-fledged 
parsers is uniform, and can therefore be given by a single deduction engine that 
performs sound and complete bottom-up interpretation of the rules of inference. 
The implemented deduction engine that we described has proved useful for rapid 
prototyping of parsing algorithms for a variety of formalisms including variants 
of tree-adjoining grammars, categorial grammars, and lexicalized context-free 
grammars. 
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A Full Code for the Deductive Parsing Engine 



(Listing of file infer.pl) 

/*========================================================== 

Parser Based on a General Tabular Inference Engine 
==========================================================*/ 

/* 

LIBRARIES 

*/ 

:- use_module (library (readin) ) . '/, provides: read_in/l 

:- use_module(library(lists) ) . °/, provides: append/3 

'/, reverse/2 

/* 

COMPONENTS 

*/ 

:- ensure_loaded ( input ) . 
:- ensure_loaded (driver) . 
:- ensure_loaded(items) . 
:- ensure_loaded(inf erence) . 
:- ensure_loaded(grammars) . 
:- ensure_loaded(utilities) . 
:- ensure_loaded (monitor) . 

(End listing of file infer.pl) 



A.l Reading and Encoding of Input 

It is standard in the logic grammar literature to use a list encoding of strings 
and string positions. A string is taken to be a list of words, with string positions 
encoded as the suffix of that list beginning at the index in question. Thus, po- 
sition 3 of the string encoded [terry, writes, a, program, that, halts] 
would be encoded by the list [a, program, that, halts]. Items, which in- 
clude one or more such string positions, can become quite large, and testing 
for identity of string positions cumbersome, especially as these items are to be 
stored directly in the Prolog database. For this reason, we use a more direct 
encoding of string positions, and a commensurately more complicated encoding 
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of the underlying strings. String positions will be taken to be integers. A string 
will then be encoded as a series of unit clauses (using the predicate word/2 spec- 
ifying which words occur before which string positions. For instance, the string 
"Terry writes a program that halts" would be specified with the unit clauses 

word(l , terry) . 
word(2, writes), 
word (3, a) . 
word (4, program) . 
word(5, that) . 
word(6, halts) . 

The end of the string is no longer explicitly represented in this encoding, so 
that a predicate sentencelength/1 will be used to specify this information. 

sentencelength(6) . 

A predicate to read in an input string can perform the conversion to this 
encoded form automatically, asserting the appropriate unit clauses as the string 
is read in. 



(Listing of file input.pl) 

/* 

READING SENTENCES AND PREPARING PARSER INPUT 
*/ 

7.7.7. sentencelength(L) 

va ================= 

7.7.7. 

7,7,7, L is the length of the sentence being parsed. 

:- dynamic sentencelength/1. 

7.7.7. word (I, W) 
7.7.7. ========== 

7.7.7. 

7.7.7. W is the Ith word in the sentence being parsed. 
:- dynamic word/2. 

7.7.7. read_input 
7.7.7. ========== 
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7.7.7. 

7.7.7. Read a sentence from the user and assert its words 
7.7.7. and length. 

read_input :- 
read_in(S) , 
encode_sentence (S) . 



7.7.7. encode_sentence (+Sentence) 
7.7.7. ========================= 

7.7.7. 

7.7.7. Clear input, store and encode input Sentence. 

encode_sentence (Sentence) :- 
retractall(word(_,_)) , 
retractall(sentencelength(_)) , 
encode_words (Sentence, 0, Length), 
assert (sentencelength(Length) ) . 



7.7.7. encode_words(+Words, +P0, -P) 
7.7.7. ============================= 

7.7.7. 

7.7.7. Store input Words from position P0 + 1 to P. 

encode_words (['.'] , Length, Length) :- !. 

encode_words( [Word I Words] , LengthO, Length) :- 
Lengthl is LengthO + 1 , 
assert (word (Lengthl , Word) ) , 
encode_words (Words, Lengthl, Length). 



(End listing of file input.pl) 



A. 2 Deduction Engine Driver 

The main driver operates as per the discussion in Section |[ 

(Listing of file driver.pl) 

/* 

INFERENCE ENGINE 
*/ 
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7.7.7. parse (-Value) 

•/:/:/. ============= 

7.7.7. 

7.7.7. Value is the value corresponding to a final item 

7.7.7. generated by parsing a sentence typed in from the 

7.7.7. standard input . 



parse (Value) :- 
read_ input , 
init_chart , 
init_agenda(Agenda) , 
exhaust (Agenda) , 
f inal_item(Goal, Value), 
item_in_chart (Goal) . 



7. read a sentence 

7. init . to an empty chart 

7. init . agenda to include axioms 

7. process agenda until exhausted 

7. get form of final goal item 

7. find all such items in chart 



7.7.7. init_agenda(+Axioms , -Agenda) 
7.7.7. ============================= 

7.7.7. 

7.7.7. Add indices corresponding to each of the Axioms to 

7.7.7. an empty Agenda. 

init_agenda (Agenda) :- 

initial_items (Axioms) , 7. get axioms 

empty_agenda (Empty) , 

add_items_to_agenda (Axioms, Empty, Agenda). 

7.7.7. exhaust (+Agenda) 
7.7.7. ================ 

7.7.7. 

7.7.7. Generate all the consequences that follow from the 

7.7.7. indices in the Agenda. 

exhaust (Empty) : - 

is_empty_agenda (Empty) . 

exhaust (AgendaO) :- 

pop_agenda(AgendaO , Index, Agendal) , 
add_item_to_chart (Index) , 

add_consequences_to_agenda(Index, Agendal, Agenda), 
exhaust (Agenda) . 

7.7.7. add_consequences_to_agenda(+Index, +AgendaO, -Agenda) 
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Ill 

7,7,7, Add to AgendaO all the indices that follow 

7,7,7, immediately from Index, yielding Agenda. 

add_consequences_to_agenda(Index, AgendaO, Agenda) :- 
all_solutions (Consequence , 

consequence (Index, Consequence), 

Consequences) , 
add_items_to_agenda(Consequences, AgendaO, Agenda). 

7,7,7, consequence (Index, Consequent) 
7,7,1 ============================= 

7,7,1 

7,7,7, Consequent is the consequent of an inference rule 

7,7,7, whose antecedent is satisfied by the item given by 

7,7,7, Index and other items in the chart . 

consequence (Index, Consequent) :- 
index_to_item(Index, Trigger), 

matching_rule (Trigger, RuleName, Others, Consequent, 

SideConds) , 
items_in_chart (Others , Index), 
hold(SideConds) , 

notif y_consequence (RuleName, Trigger, Others, 
SideConds, Consequent). 

7,7,7, items_in_chart(+Items, +Index) 
17,1 ============================= 

17.1 

7,7,7, All the elements of Items, generated when processing 

7,7,7, Index from the agenda, are satisfied by stored items 

17,1 in the chart. 

items_in_chart ( [] , _Index) . 

items_in_chart ( [Antecedent I Antecedents] , Index) : - 
item_in_chart (Antecedent , Index) , 
items_in_chart (Antecedents , Index) . 



7.7,7. hold(+Conditions) 
7.7.7. ================= 

17.1 

7.7.7. All the side Conditions hold. 
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hoidcm . 

hold([Cond|Conds]) :- 
call(Cond) , 
hold(Conds) . 



VI'/, matching_rule (+Trigger , 

V/.'l, -RuleName, -Others, -Consequent, -SideConds) 

%VI, ========================================================== 

v/:i, 

°/,V/, Find an inference rule RuleName with antecedent of 

'/X/. the form U 9 [Trigger] Q V, where Others is U V, 

"/,'/,'/, Consequent is the consequent of the rule and 

7.7.7. SideConds are its side conditions. (0 denotes here 

7,7,'/, list concatenation) . 

mat ching_rule (Trigger , 

RuleName, Others, Consequent, SideConds) :- 
inference (RuleName , Antecedent, Consequent, SideConds), 
split (Trigger, Antecedent, Others). 

(End LISTING of file driver.pl) 



A. 3 Stored Items Comprising Chart and Agenda 



(Listing of file items.pl) 

/* 

STORED ITEMS 

*/ 

7,7,7, stored(Index, Item) 
7.7.7. =================== 

7.7.7. 

7,7,7, Predicate used to store agenda and chart Items in 

7,7,7, the Prolog database along with a unique identifying 

7,7,7, Index, assigned in numerical order. 

:- dynamic stored/2. 



7,7.7, key_index(Key , Index) 
7.7.7. ===================== 

7.7.7. 
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"Lll Predicate used to store an auxiliary indexing table 

7o7.7. for indexing stored items. The predicate 

'/,'/,'/, item_to_key/2 is used to compute the key for an 

•/.•/.•/. item. 

:- dynamic key _ index/2. 



Ill item_stored(+Item, -Index) 
in ========================== 

III 

III Finds a stored Item amd its Index in the sequence of 

Thl stored items. 

item_stored(Item, Index) :- 
item_to_key(Item, Key), 
key_index(Key, Index), 
stored(Index, Item). 



Ill similar_item(+Item, -Storedltem) 

in ================================ 

in 

°/>n Find a stored item Storedltem in the stored items 

'/.y.y, that might subsume Item. 

similar_item(Item, Similarltem) :- 
item_to_key (Item, Key) , 
key_index(Key, Indexof Similar) , 
stored(Indexof Similar , Similarltem) . 

'Lll subsumed_item(+Item) 

in ==================== 

III Item is subsumed by some stored item. 

subsumed_item(Item) :- 

similar_item(Item, Otherltem) , 
subsumes(0therltem, Item) . 

/* 

CHART and AGENDA 
*/ 

:- ensure_loaded ( chart ) . 
:- ensure_loaded ( agenda) . 
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(End listing of file items.pl) 



A. 3.1 Chart Items 



(Listing of file chart.pl) 

/* 

CHART 

*/ 

/.'/;/. init_chart 

III ================ 

III 

III Remove any bits of (agenda or) chart clauses and 

III associated keys from the Prolog database. 

init_chart : - 

retractall(stored(_,_)) , 
retractall(key_index(_,_)) . 



Ill item_in_chart (?Item, +Ref Index) 
III =============================== 

III 

III Retrieve a stored item matching Item. Ref Index is a 

III reference index that distinguishes items in the 

III chart (at or before the reference index) from those 

III in the agenda (after the reference index) . It is 

III the index of the item in the chart with the largest 

III index. 

item_in_chart (Item, Ref Index) :- 
item_stored(Item, Itemlndex) , 
(Itemlndex =< Ref Index 

17, Item is at or before reference, so it is in the 

11 chart 

-> true 

7,1 Item is after reference, so it AND ALL LATER 
11 ITEMS are in the agenda, so stop looking for 
17, other chart items . 
; !, fail). 



Ill item_in_chart(?Item) 
7.7.7. ==================== 
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7.7.7. 

7.7.7. Item is an item in the chart generated after agenda 

7.7.7. is exhausted (so there is no reference index 

'/o7.7. pointing to the end of the chart, and all stored 

7.7.7. items are chart items) . 

item_in_chart (Item) :- 

item_stored(Item, _) . 

7.7.7. add_item_to_chart (Index) 

'/:/'/. ======================== 

7.7.7. 

7.7.7. Add the item stored at Index in the stored items to 

7.7.7. the chart. (Nothing need be done, since moving on 

7.7.7. to the next agenda item changes the reference index, 

7.7.7. thereby implicitly making the item a chart item, so 

7.7.7. we just print debugging information.) 

add_item_to_chart (Index) :- 

notif y_chart_addition(Index) . 

(End listing of file chart.pl) 



A. 3. 2 Agenda Items 

The agenda items are just a contiguous subsequence of the stored items. The 
specification of which items are in the agenda (and therefore which are implicitly 
in the chart) is provided by the head and tail indices of the agenda subsequence. 
This queue specification of the agenda, packed into a term under the functor 
queue/2, is passed around as an argument by the deduction engine. A term 
queue (Head, Tail) represents a queue of agenda items where Head is the index 
of the first element in the queue and Tail is the index of the next element to 
be put in the queue (one more than the current last clement). 

Notice the asymmetry between enqueueing and dequeueing: enqueucing 
(add_item_to_agenda/3) takes explicit items, dequeueing (pop_agenda/3) pro- 
duces indices that may then be mapped to items by index_to_item/2. This 
is somewhat inelegant, but balances the need for abstraction in the generic al- 
gorithm with the efficiency of the main all_solutions there, which need not 
store the item whose consequences are being sought. 

This implementation is adequate because the items in the agenda always 
form a contiguous set of stored items, so their indices are sequential. 



(Listing of file agenda.pl) 
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/* 



AGENDA 



*/ 



7,7,7, is_empty_agenda (+Agenda) 
7,7.7, ======================== 

7.7,7, 

7,7,7, Holds if Agenda represents an empty agenda. 

is_empty_agenda(queue (Front , Back)) :- 
Front >= Back. 



7,7,7, empty_agenda(-Agenda) 
7,7.7, ===================== 

7.7.7. 

7.7,7, Agenda is a new empty agenda. 

empty_agenda (queue (0, 0)). 



7.7,7, pop_agenda(+Agenda, -Index, -NewAgenda) 
7.7.7. ====================================== 

7.7.7, 

7,7,7, Index is the top Index in the Agenda, NewAgenda is 

7,7,7, the Agenda with that item removed. 

pop_agenda (queue (Front , Back), 

Front , queue (NewFront , Back) ) : - 
Front < Back, 
NewFront is Front + 1 . 



7.7,7, add_item_to_agenda(+Item, +AgendaO, -Agenda) 
7.7.7, ============================================ 

7.7.7. 

7.7,7, Add the index corresponding to Item to AgendaO, 
7.7,7, yielding Agenda. This stores the appropriate items 

7,7,7, in the Prolog database. Note that the stored/2 

7.7,7, clause must be asserted at the end of the database 

7.7,7, (even though the index numbering provides ordering 

7.7,7, information already) to allow early cut off of 

7.7,7, searches in item_in_chart/2 (q.v.). 

add_item_to_agenda(Item, queue(Front, Back), 
queue (Front , NewBack) ) : - 
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notif y_agenda_addition(Item) , 

(\+ subsumed_item(Item) 

-> (assertz(stored(Back, Item)), 
item_to_key (Item, Key), 
assert (key_index (Key, Back)), 
NewBack is Back + 1) 

; NewBack = Back) . 



'/'/,'/, add_items_to_agenda(+Items , +AgendaO, -Agenda) 

7.7.7. ============================================== 

7.7.7. 

7,7,7, Add indices corresponding to all of the Items to 

7,7,7, AgendaO yielding Agenda. 

add_items_to_agenda( [] , Agenda, Agenda). 

add_items_to_agenda( [Item I Items] , AgendaO, Agenda) :- 
add_item_to_agenda(Item, AgendaO, Agendal) , 
add_items_to_agenda(Items , Agendal, Agenda). 



7,7,7, index_to_item( Index, Item) 
7.7.7. ========================== 

7.7.7. 

7,7,7, Item is the actual stored item for Index. 

index_to_item(Index, Item) :- 
stored(Index, Item) . 



(End listing of file agenda.pl) 



A. 4 Encoding of Deductive Parsing Systems 

We present the Prolog encodings of several of the deduction systems discussed 
above including all of the context-free systems from Section @ and the CCG 



system described in Section 4.2 



The deduction systems for context-free-based (definite clause) grammars all 
assume the same encoding of a grammar as a series of unit clauses of the fol- 
lowing forms: 

Grammar rules: Grammar rules are encoded as clauses of the form LHS > 

RHS where LHS (a nonterminal) and RHS (a list of nonterminals and preter- 
minals) are respectively the left- and right-hand side of a rule. 
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Lexicon: The lexicon is encoded as a relation between preterminals and termi- 
nals by unit clauses of the form lex(Term, Preterm), where Preterm 
is a preterminal (nonterminal dominating a terminal) that covers the 
Terminal. 

Start symbol: The start symbol of the grammar is encoded by a unit clause 
of the form startsymbol (Start) , where Start is a start nonterminal for 
the grammar; there may be several such nonterminals. 

Nonterminals and terminal symbols are encoded as arbitrary terms and 
constants. The distinction between nonterminals and terminals is implicit in 
whether or not the terms exist on the left-hand side of some rule. 



Parsing algorithms are specified as an inference system. 
This includes a definition of a class of items, and some 
inference rules over those items. Subsets corresponding to 
initial items and final items are also defined. 

The following predicates are used in defining an inference 
system: 

'/,'/,'/, initial_item(Item) Item is an initial item. 



(Listing of file inference.pl) 



/* 



INFERENCE RULES 



111 



111 



final_it em (Value) 



Value is the pertinent information 
to return about some final item. 



inference (RuleName 



Antecedent , 

Consequent, SideConditions) 



111 
111 
Ul 

ni 
ni 



Specifies an inference rule named RuleName with 
Antecedent items, a Consequent item, and some 
SideConditions . 



The following predicate is used to define appropriate 
indexing of the items : 



1'1'L item_to_key (+Item, -Key) 

vi:l ======================== 
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7.7.7. 

7.7.7. Key is a hash key to associate with the given Item. 
7.7,'/, The item will be stored in the Prolog database under 

7,7.7, that key. 

Definitions of these predicates can be found in the 

following files: 

*/ 

7, :- ensure_loaded( ' inf -top-down. pi' ) . 
7, :- ensure_loaded( ' inf -bottom-up. pi ' ) . 
:- ensure_loaded( ' inf -earley.pl') . 
'/,- ensure_loaded(' inf -ccg.pl') . 

7,7,7, initial_items (-Items) 
7,7.7, ===================== 

7,7.7. 

7.7.7. Items are the initial items of the inference system. 

initial_items (Items) :- 

all_solutions (Item, initial_item(Item) , Items). 

(End listing of file inference.pl) 



.4.1 The Top-Down System 



(Listing of file inf -top-down. pi) 

/* 

Parsing Algorithm Inference System 
Pure Top-Down Parsing 
*/ 



:- op(1200,xfx, >) . 



/* 

ITEM ENCODING 

*/ 
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7.7.7. item(AfterDot, I, Value) 
%'L'I. ======================== 

7.7.7. 

7.7.7. AfterDot is a list of nonterminals and terminals 

7.7.7. that need to be found from position I in the string 

7.7.7. to the end of the string. Value is some term that 
'/o7.7. is passed around among all the items to be returned 

7.7.7. as the final value associated with the parse. It is 

7.7.7. seeded to be the Start category of the parse, but 

7.7.7. may become further instantiated as the parse 
7.7.7. progresses. 

initial_item(item( [Start] , 0, Start)) :- 
startsymbol (Start) . 

f inal_item(item( [] , Length, Value), Value) :- 
sentencelength(Length) . 



/* 

ITEM INDEXING 

*/ 

7.7.7. item_to_key (+Item, -Index) 
7.7.7. ========================== 

7.7.7. 

7,7.7. Items are indexed by position and category of first 

7.7.7. constituent . 

item_to_key (item( [First I _] , I, _Value) , Index) :- 
First =. . [Firstcat I _] , 
hash_term(a(I,Firstcat) , Index). 

item_to_key (item( [] , I, _Value) , Index) :- 
hash_term(a(I ,none) , Index). 



/* 

INFERENCE RULES 

*/ 

7.7.7. 

7.7.7. SCANNER: 

inference ( scanner, 
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[ item ( [B I Beta] , I, Value) ], 
I 

item(Beta, II, Value), 

'/, where 

[II is I + 1, 
word(Il , Bterm) , 
lex(Bterm, B)] ). 

7.7.7. 

7.7.7. PREDICTOR: 

inference ( predictor, 

[ item ([A I Alpha] , I, Value) ], 
y, 

item(BetaAlpha, I, Value), 

'/, where 

[(A — > Beta) , 

append(Beta, Alpha, BetaAlpha)] ) . 



(End listing of file inf-top-down.pl) 



A.4.2 The Bottom-Up System 



(Listing of file inf -bottom-up. pi) 
/* 

Parsing Algorithm Inference System 
Pure Bottom-Up Parsing 
*/ 



:- op(1200,xfx, >) . 

/* 



ITEM ENCODING 

*/ 



7.7.7. item(BeforeDot, I) 
7.7.7. ================== 
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7.7.7. r 

7,7,7, BeforeDot is a list of nonterminals and terminals 

7,7,7, that have been found from the start of the string 

7,7,7, through position I . Note that the stack of parsed 

7,7,7, constituents is kept in reversed order, with the 

7,7,7, most recently parsed at the left edge of the list . 

initial_item(item( [] , 0)). 

final_item(item( [Value] , Length), Value) :- 
sentencelength(Length) , 
startsymbol (Value) . 



/* 

ITEM INDEXING 

*/ 

7,7,7, item_to_key (+Item, -Index) 
7,7.7. ========================== 

7.7.7. 

7.7.7. Items are indexed by position and category of first 

7.7.7. constituent . 

item_to_key(item( [First I _] , I), Index) :- 
First =.. [FirstcatlJ , 
hash_term(a(I,Firstcat) , Index). 

item_to_key(item( [] , I), Index) :- 
hash_term(a(I,none) , Index). 



/* 

INFERENCE RULES 

*/ 

7.7.7. 

7.7.7. SHIFT: 

inference ( shift, 

[ item(Beta, I) ] , 
t 

item ( [B I Beta] , II), 

7, where 

[II is I + 1, 
word(Il , Bterm) , 
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lex(Bterm, B)] 



). 



7.7.7, 

7.7.7, REDUCE: 

reduce , 

[ item(BetaAlpha, I) ] , 



it em ( [A I Alpha] , I), 

[(A > Beta) , 

reverse(Beta, BetaR) , 

append(BetaR, Alpha, BetaAlpha)] ). 

(End listing of file inf -bottom-up. pi) 



.4.3 The Earley's Algorithm System 



(Listing of file inf -earley.pl) 

/* 

Parsing Algorithm Inference System 

Earley's Algorithm 
*/ 



:- op(1200,xfx, >) . 

:- ensure_loaded(library (strings) ) . 



/* 

ITEM ENCODING 

*/ 

7,17. item(LHS, BeforeDot, AfterDot, I, J) 
7.7.7. ==================================== 

7.7.1 r 

7.7.7. LHS -> BeforeDot AfterDot r 

7,7.7, is a DCG production where BeforeDot is the 

7,7,7, reversal of BeforeDot 



inf erence( 
7, where 
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7.7.7. I, J are the two indices into the string 

initial_item(item('<start>' , [] , [Start], 0,0)) :- 
startsymbol (Start) . 

f inal_item(item( ' <start> ' , [Start], [] , 0, Length), 
Start) :- 
startsymbol (Start) , 
sentencelength(Length) . 



/* 

ITEM INDEXING 

*/ 

'/o7.7. item_to_key (+Item, -Key) 
7.7.7. ======================== 

7.7. Active edges are indexed by the category of the 

7.7. constituent after the dot and the starting position of 

7.7. that constituent. 

item_to_key(item(_A, _Alpha, [B|_Beta], _I, J), Hash) :- 
B = . . [Beat I _] , 
hash_term(a(J,Bcat) , Hash). 

7.7. Passive edges (with nothing after the dot) are indexed by 
7.7. the category of the parent constituent, and the starting 
7.7. position of that constituent. 
item_to_key(item(A, _Alpha, [] , I, _J) , Hash) :- 

A =. . [Acat I _] , 

hash_term(p(I,Acat) , Hash). 



/* 

INFERENCE RULES 

*/ 

7.7.7. 

7.7.7. SCANNER: 

inference ( scanner, 

[ item (A, Alpha, [B I Beta], I, J) ], 
I 

item(A, [B I Alpha], Beta, I, Jl) , 

7. where 

[Jl is J + 1, 



GO 



word( Jl , Bterm) , 

lex(Bterm, B)] ). 

7.7.7. 

7.7.7. PREDICTOR: 

inference ( predictor, 

[ item(_A, _Alpha, [B|_Beta], _I,J) ], 
I 

item(B, [] , Gamma, J, J), 

7. where 

[(B > Gamma)] ) . 

y.7.7. 

7.7.7. COMPLETOR : 

7.7.7. Type 1 and 2 Completor 
inference ( completor, 

[ item (A, Alpha, [B I Beta], I, J), 
item(B, _Gamma, [] , J,K) ] , 
% 

item(A, [B I Alpha], Beta, I,K), 

7. where 

□ ). 
(End listing of file inf -earley.pl) 



A. 4. 4 The Combinatory Categorial Grammar System 

The CCG parser in this section assumes an encoding of the CCG grammar/lexicon 
as unit clauses of the form lex (Word, Category) , where Word is a word in the 
lexicon and Category is a CCG category for that word. Categories are encoded 
as terms using the infix functors + for forward slash and - for backward slash. 
The start category is encoded as for context-free grammars above. 



(Listing of file inf-ccg.pl) 

/* 

Parsing Algorithm Inference System 

Bottom-Up Combinatory Categorial Grammar Parsing 
*/ 
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/* 



ENCODING 



*/ 



VI'/. item (Cat, I, J, Deriv) 

va ================================================= 

'I.VI. Cat is a CCG category 

VI.'/. I, J are the two indices into the string 

7.7.'/. Deriv is the derivation of the item 



initial_item(item(Cat , I , Jl , [Cat, Word])) :- 
word( Jl , Word) , 
lex (Word, Cat) , 
I is Jl - 1. 



f inal_item(item(Start ,0 , Length, D), D) :- 
startsymbol (Start) , 
sentencelength(Length) . 

/* 

ITEM INDEXING 

*/ 



7.7.7. item_hash(Item, Index) 
7.7.7. ====================== 

7.7. Temporarily disabled. 
item_hash(_Item, index) . 



/* 

INFERENCE RULES 

*/ 

7.7.7. 

7.7.7. FORWARD APPLICATION: 

inference( forward-application, 

[ item(X+Y, I, J, Dl) , item(Y, J, K, D2) ], 
y t 

item(X,I,K, [Dl, D2] ) , 

7. where 
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[] ). 



7.7.7. 

7.7.7. BACKWARD APPLICATION: 

inference( backward-application, 

[ item(Y, I, J, Dl) , item(X-Y, J, K, D2) ], 
t 

item(X,I,K, [Dl, D2] ) , 

7. where 

[] ). 

7.7.7. 

7.7.7. FORWARD COMPOSITION 1: 

inference ( f orward-compositionl , 

[ item(X+Y, I, J, Dl) , item(Y+Z, J, K, D2) ], 
t 

item(X+Z,I,K, [Dl, D2] ) , 

7. where 

[] ). 



7.7.7. 

7.7.7. FORWARD COMPOSITION 2: 

inference ( f orward-compositionl , 

[ item(X+Y, I, J, Dl) , item(Y-Z, J, K, D2) ], 
y> 

item(X-Z,I,K, [Dl, D2] ) , 

7. where 

[] ). 



7.7.7. 

7.7.7. BACKWARD COMPOSITION 1: 

inference ( backward-compositionl , 

[ item(Y+Z, I, J, Dl) , item(X-Y, J, K, D2) ], 
% 

item(X+Z,I,K, [Dl, D2] ) , 

7. where 

[] ). 



7.7.7. 

7.7.7. BACKWARD COMPOSITION 2: 
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inference ( backward-composition2 , 

[ item(Y-Z, I, J, Dl) , item(X-Y, J, K, D2) ], 
y> 

itemCX-Z.I.K, [Dl, D2] ) , 

7 where 

[] ). 



(End listing of file inf -ccg.pl) 



A. 5 Sample Grammars 



(Listing of file grammars.pl) 
/* 



SAMPLE GRAMMARS 

*/ 



: - ensure_loaded ( ' gram-dcg . pi ' ) . 
'/,:- ensure_loaded( ' gram-ccg.pl ' ) , 



(End listing of file grammars . pi) 



A. 5.1 A Sample Definite-Clause Grammar 

The following is the definite-clause grammar of Figure encoded as per Sec- 



tion 



A.4 



(Listing of file gram-dcg.pl) 

s(s(NP,VP)) — > [np(NP), vp(VP)]. 

np(np(Det,N,Rel)) > 

[det(Det), n(N) , optrel(Rel)] . 
np(np(PN)) — > [pn(PN)]. 
vp(vp(TV,NP)) — > [tv(TV), np(NP)]. 
vp(vp(IV)) — > [iv(IV)] . 

optrel(rel(that,VP)) > [relpro, vp(VP)] . 

7optrel(rel(epsilon) ) > [] . 
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lex(that, relpro) . 
lex(terry, pn(pn(terry) ) ) . 
lex ( shrdlu , pn (pn ( shrdlu) ) ) . 
lex(halts, iv(iv(halts) ) ) . 
lex(a, det(det(a))) . 
lex(program, n (n (program) )) . 
lex(writes, tv(tv(writes) ) ) . 

startsymbol(s (_) ) . 

(End LISTING of file gram-dcg.pl) 



A. 5. 2 A Sample Combinatory Categorial Grammar 

The following is the combinatory categorial grammar of Figure ^ encoded 
propriately for the CCG deduction system. 



(Listing of file gram-ccg.pl) 

lex ( john, np) . 

lex (bananas, np) . 

lex(likes, (s-np)+np) . 

lex(really, ( (s-np)+(s-np) ) ) . 

startsymbol(s) . 

(End listing of file gram-ccg.pl) 

A.6 Utilities 

(Listing of file utilities.pl) 
/* 

UTILITIES 

*/ 

7.YX subsumes (+General , +Specific) 
y.7.7. ============================= 

m, 

7°/>h Holds if General subsumes Specific. 
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7,7,7, Note that Quintus Prolog 3.x has a subsumes_chk/2 that 
7,7,7, could replace subsumes/2. The explicit implementation 
7,7,7, is left here to illustrate this standard Prolog idiom 
7,7,7, for subsumption testing. 

subsumes (General , Specific) :- 

\+ \+ ( make_ground(Specif ic) , 
General = Specific ) . 

7,7,7, make_ground(Term) 
7,7.7, ================= 

7,7,7, 

7,7,7, Instantiates all variables in Term to fresh constants. 

make_ground(Term) :- 

numbervars(Term, 0, _) . 



7,7,7, all_solutions(Term, Goal, Solutions) 
7,7.7. ==================================== 

7.7.7, 

7.7.7. Solutions is a list of instances of Term such that 

7.7.7. Goal holds . All free variables in Goal are taken to 

7.7.7. be existentially quantified. Solutions may be the 
7.7.7. empty list if there are no solutions . 

7.7.7. This implementation relies on the details of findall in 
7.7.7. Quintus Prolog. It could be reimplemented using the 
7.7.7. more standard built-in predicate setof/3 as follows: 
7.7.7. 

7.7.7. all_solutions(Var, Goal, Solutions) :- 
'/.'/.'/. setof(Var, Goal~Goal, Solutions). 

all_solutions(Var, Goal, Solutions) :- 
findall(Var, Goal, Solutions). 



7.7.7. split (Elem, List, Rest) 
7.7.7. ======================= 

7.7.7. 

7.7.7. List = U <§ [Elem] V and Rest = U V. 

split(Term, [TermlRest], Rest). 
split(Term, [First I RestO] , [First I Rest] ) :- 
split(Term, RestO, Rest). 
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(End listing of file utilities.pl) 



A. 7 Monitoring and Debugging 



(Listing of file monitor.pl) 

/* 

MONITORING 

*/ 



'/,'/,'/, verbose 
va ======= 

y.7.7. 

'I'I'L Predicate governs the degree of verbosity in the 
'I'I'L notifications. 

:- dynamic verbose/0. 

I'L'L notify,. . .(...) 
t'L'L =============== 

•/.•/.•/. 

"L'L'L Prints debugging information if the flag verbose/0 
v.'/.'/. is true. 

notify_consequence(RuleName, Trigger, Others, 
SideConds, Consequent) :- 

( verbose -> 

formatC p: n trigger: p n" , 
[RuleName , Trigger] ) , 
formatC others: p n" , [Others]), 

formatC side conds : p n" , [SideConds]), 
formatC cons: p n", [Consequent]) 

; true ) . 



notif y_agenda_addition(Item) :- 
(verbose 

-> (formatC NAdding to agenda: <-> p n' , [Item])) 
; (print ('.'), ttyflush)). 

notif y_chart_addition(Index) :- 
index_to_item(Index, Item), 
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item_to_key (Item, Key) , 
(verbose 

-> (formatC NAdding to chart: < p> p n' , 

[Key, Item])) 
; (print (':'), ttyflush)). 

(End listing of file monitor.pl) 
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